Mechanistic and Attributional Interpretability in Neuroscience
- đ¤ Speaker: Dr. Michail Mamalakis (University of Cambridge)
- đ Date & Time: Tuesday 03 March 2026, 15:00 - 16:00
- đ Venue: Computer Laboratory, William Gates Building, Room LT1
Abstract
In this seminar, we will briefly discuss basic terminology in mechanistic interpretability, examine different sparse auto-encoder (SAEs) architectures and techniques beyond SAEs, and explore examples of LLMs applied in neuroscience, as well as how mechanistic interpretability and attribution methods can be combined to identify new patterns.
Series This talk is part of the Foundation AI series.
Included in Lists
- All Talks (aka the CURE list)
- Artificial Intelligence Research Group Talks (Computer Laboratory)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Laboratory, William Gates Building, Room LT1
- Department of Computer Science and Technology talks and seminars
- Guy Emerson's list
- Hanchen DaDaDash
- Interested Talks
- Martin's interesting talks
- ndk22's list
- ob366-ai4er
- PhD related
- rp587
- School of Technology
- Speech Seminars
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 03 March 2026, 15:00-16:00