NIPS Proceedings
β
Books
Hartmut Maennel
1 Paper
Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
(2019)