Circuits and Interpretability
- 👤 Speaker: Lauro Langosco, Elre Oldewage and Juyeon Heo(University of Cambridge)
- 📅 Date & Time: Wednesday 16 February 2022, 11:00 - 12:30
- 📍 Venue: Cambridge University Engineering Department ,LR3A
Abstract
In this talk we will look at methods that aim to make the internal computations of neural networks visible (‘interpretable’) to humans. This is useful for a) making deep learning models robust / fair / safe and b) in order to come to an empirical, scientific understanding of why deep learning works. We will cover various methods from the literature, and focus in particular on the study of circuits, i.e. modular subnetworks that serve a particular function.
Recommended reading:
The Building Blocks of Interpretability (https://distill.pub/2018/building-blocks/)
Optional reading:
- Adversarial Examples Are Not Bugs, They Are Features (https://arxiv.org/abs/1905.02175)
- Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks (https://arxiv.org/abs/2010.02066)
Our reading groups are live-streamed via Zoom and recorded for our Youtube channel. The Zoom details are distributed via our weekly mailing list.
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department ,LR3A
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 16 February 2022, 11:00-12:30