Benefits and Shortcomings of Assistance
- đ¤ Speaker: Dmitrii Krasheninnikov and Lauro Langosco, University of Cambridge
- đ Date & Time: Wednesday 22 June 2022, 11:00 - 12:30
- đ Venue: Cambridge University Engineering Department, CBL Seminar room BE4-38
Abstract
Assistance games (also known as cooperative inverse RL) enable a single RL policy to both infer human preferences and act such that they are optimized. The idea is to model the human as a part of the environment, and the true reward function as a latent variable in the environment that the agent may make inferences about. Our talk will introduce the assistance paradigm, compare it to reward learning, and discuss its flaws in the context of AI Alignment.
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department, CBL Seminar room BE4-38
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 22 June 2022, 11:00-12:30