NIPS Proceedingsβ

Rémi Munos

20 Papers

  • Adaptive Stratified Sampling for Monte-Carlo integration of Differentiable functions (2012)
  • Bandit Algorithms boost Brain Computer Interfaces for motor-task selection of a brain-controlled button (2012)
  • Risk-Aversion in Multi-armed Bandits (2012)
  • Finite Time Analysis of Stratified Sampling for Monte Carlo (2011)
  • Optimistic Optimization of a Deterministic Function without the Knowledge of its Smoothness (2011)
  • Selecting the State-Representation in Reinforcement Learning (2011)
  • Sparse Recovery with Brownian Sensing (2011)
  • Speedy Q-Learning (2011)
  • Error Propagation for Approximate Policy and Value Iteration (2010)
  • LSTD with Random Projections (2010)
  • Scrambled Objects for Least-Squares Regression (2010)
  • Compressed Least-Squares Regression (2009)
  • Sensitivity analysis in HMMs with application to likelihood maximization (2009)
  • Algorithms for Infinitely Many-Armed Bandits (2008)
  • Online Optimization in X-Armed Bandits (2008)
  • Particle Filter-based Policy Gradient in POMDPs (2008)
  • Fitted Q-iteration in continuous action-space MDPs (2007)
  • Efficient Resources Allocation for Markov Decision Processes (2001)
  • Barycentric Interpolators for Continuous Space and Time Reinforcement Learning (1998)
  • Reinforcement Learning for Continuous Stochastic Control Problems (1997)