NIPS Proceedingsβ

Matteo Pirotta

7 Papers

  • Exploration Bonus for Regret Minimization in Discrete and Continuous Average Reward MDPs (2019)
  • Regret Bounds for Learning State Representations in Reinforcement Learning (2019)
  • Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes (2018)
  • Adaptive Batch Size for Safe Policy Gradients (2017)
  • Compatible Reward Inverse Reinforcement Learning (2017)
  • Regret Minimization in MDPs with Options without Prior Knowledge (2017)
  • Adaptive Step-Size for Policy Gradient Methods (2013)