General Reinforcement Learning
- đ¤ Speaker: Jan Leike (Australian National University) đ Website
- đ Date & Time: Wednesday 16 December 2015, 11:00 - 12:00
- đ Venue: Engineering Department, CBL Room BE-438
Abstract
Reinforcement learning problems are often phrased in terms of Markov decision processes (MDPs). In this talk, we go beyond MDPs and consider reinforcement learning in environments that are non-Markovian, non-ergodic and only partially observable. Our focus will not be on practical algorithms, but rather on the fundamental underlying problems. How do we balance exploration and exploitation? How do we explore optimally? When is an agent optimal? We introduce the Bayesian agent AIXI , point out some of its problems, and discuss potential solutions.
Speaker: Jan Leike is a PhD student at the Australian National University working with Marcus Hutter.
Series This talk is part of the Machine Learning @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- Biology
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge Neuroscience Seminars
- Cambridge talks
- CBL important
- Chris Davis' list
- Creating transparent intact animal organs for high-resolution 3D deep-tissue imaging
- dh539
- dh539
- Engineering Department, CBL Room BE-438
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Joint Machine Learning Seminars
- Life Science
- Life Sciences
- Machine Learning @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- Neuroscience
- Neuroscience Seminars
- Neuroscience Seminars
- ob366-ai4er
- Required lists for MLG
- rp587
- Seminar
- Simon Baker's List
- Stem Cells & Regenerative Medicine
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)



Wednesday 16 December 2015, 11:00-12:00