NIPS Proceedingsβ

Matteo Pirotta

5 Papers

  • Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes (2018)
  • Adaptive Batch Size for Safe Policy Gradients (2017)
  • Compatible Reward Inverse Reinforcement Learning (2017)
  • Regret Minimization in MDPs with Options without Prior Knowledge (2017)
  • Adaptive Step-Size for Policy Gradient Methods (2013)