NIPS Proceedingsβ

Matteo Pirotta

4 Papers

  • Adaptive Batch Size for Safe Policy Gradients (2017)
  • Compatible Reward Inverse Reinforcement Learning (2017)
  • Regret Minimization in MDPs with Options without Prior Knowledge (2017)
  • Adaptive Step-Size for Policy Gradient Methods (2013)