Cooperative Inverse RL
- đ¤ Speaker: Robert Pinsler; Adria Garriga Alonso
- đ Date & Time: Thursday 26 October 2017, 13:30 - 15:00
- đ Venue: Engineering Department, CBL Seminar Room 4-38
Abstract
Abstract:
The value alignment problem consists in ensuring the values of an AI system align with the values of its operator. A potential solution to this problem is formalised as the Inverse Reinforcement Learning (IRL) setting. In IRL , the goal is to infer the reward function of an agent (a human), just from observing its behaviour in the environment. In Cooperative IRL , the agents are allowed to interact. From this, more effective teaching strategies than passive observation emerge. We will talk about formalising this problem, and an algorithm to approximate good teaching strategies.
Recommended reading:
Main Paper:- Cooperative Inverse Reinforcement Learning: https://papers.nips.cc/paper/6420-cooperative-inverse-reinforcement-learning.pdf
- Apprenticeship Learning via Inverse Reinforcement Learning: http://ai.stanford.edu/~ang/papers/icml04-apprentice.pdf
- Maximum Entropy Inverse Reinforcement Learning: https://www.aaai.org/Papers/AAAI/2008/AAAI08-227.pdf
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Engineering Department, CBL Seminar Room 4-38
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Robert Pinsler; Adria Garriga Alonso
Thursday 26 October 2017, 13:30-15:00