Value Propagation: A Graphical Model for Bayesian Reinforcement Learning
- đ¤ Speaker: Philipp Hennig (University of Cambridge)
- đ Date & Time: Monday 17 November 2008, 11:00 - 12:00
- đ Venue: TCM Seminar Room, Cavendish Laboratory, Department of Physics
Abstract
I will present Bayesian Reinforcement Learning methods based on the model-free approach used in the Temporal Difference family of algorithms. Our implementations allow for the incorporation of prior knowledge in a principled way and automatically adapt their learning rate and backup depth. In policy iteration settings, they can guide exploration and converge on the optimal policy in an automated fashion, because they track the uncertainty of evaluations.
Series This talk is part of the Machine Learning Journal Club series.
Included in Lists
- Cambridge talks
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Interested Talks
- Machine Learning Journal Club
- Machine Learning Summary
- ML
- Quantum Matter Journal Club
- rp587
- TCM Seminar Room, Cavendish Laboratory, Department of Physics
- TQS Journal Clubs
- yk373's list
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Monday 17 November 2008, 11:00-12:00