University of Cambridge > Talks.cam > Foundation AI > Mechanistic and Attributional Interpretability in Neuroscience

Log in

Google

Microsoft

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Mechanistic and Attributional Interpretability in Neuroscience

Download to your calendar using vCal

Dr. Michail Mamalakis (University of Cambridge)
Tuesday 03 March 2026, 15:00-16:00
Computer Laboratory, William Gates Building, Room LT1.

If you have a question about this talk, please contact Dr. Michail Mamalakis .

In this seminar, we will briefly discuss basic terminology in mechanistic interpretability, examine different sparse auto-encoder (SAEs) architectures and techniques beyond SAEs, and explore examples of LLMs applied in neuroscience, as well as how mechanistic interpretability and attribution methods can be combined to identify new patterns.

This talk is part of the Foundation AI series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Mechanistic and Attributional Interpretability in Neuroscience

This talk is included in these lists:

Mechanistic and Attributional Interpretability in Neuroscience

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Mechanistic and Attributional Interpretability in Neuroscience

This talk is included in these lists:

Other lists

Other talks

Mechanistic and Attributional Interpretability in Neuroscience

Abstract

Included in Lists