Advanced artificial agents intervene in the provision of reward
- đ¤ Speaker: Michael Cohen, University of Oxford
- đ Date & Time: Wednesday 12 October 2022, 11:00 - 12:30
- đ Venue: Cambridge University Engineering Department, CBL Seminar room BE4-38
Abstract
Subject to several assumptions, advanced artificial agents are likely to intervene in the mechanism by which their feedback is provided, and extinguish life on earth. In brief, these assumptions are: 1) it identifies possible goals at least as well as a human, 2) it acts rationally under uncertainty, 3) it does not have a large inductive bias favoring the hypothesis that its goal is to influence some distant feature of the world, 4) the cost of experimenting to validate certain hypotheses is small, 5) if something isn’t theoretically impossible, it’s probably possible to arrange with a normal action space, and 6) a sufficiently advanced agent is likely to beat a suboptimal agent in a game. See the following paper for more: https://onlinelibrary.wiley.com/doi/10.1002/aaai.12064.
Speaker Bio: I’m studying a DPhil in Engineering Science with Mike Osborne at Oxford. Before that, I got a masters in computer science at the Australian National University, studying with Marcus Hutter. My research considers the expected behavior of generally intelligent artificial agents. I am interested in designing agents that we can expect to behave safely.
Zoom link will be posted later*
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department, CBL Seminar room BE4-38
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 12 October 2022, 11:00-12:30