Reward Modelling
- đ¤ Speaker: Usman Anwar, University of Cambridge
- đ Date & Time: Wednesday 24 May 2023, 11:00 - 12:30
- đ Venue: Cambridge University Engineering Department, CBL Seminar room BE4-38.
Abstract
Reward modelling broadly refers to the methods and practices for specifying the goals and objectives of a learning system and determining what constitutes a desirable outcome. Within reinforcement learning (RL), it refers to the process of designing and defining the rewards or reinforcement signals. In this talk, I will provide an overview of the popular methods for reward modelling, differentiating between implicit reward modelling methods such as imitation learning and cooperative inverse reinforcement learning, and explicit reward modelling methods such as inverse RL and RL from human feedback. I will further highlight various theoretical challenges in reward modelling, discuss use of reward modelling in language models such as GPT -4 and connections of reward modelling problem with AI alignment.
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department, CBL Seminar room BE4-38.
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 24 May 2023, 11:00-12:30