Latent Concepts in Large Language Models
- 👤 Speaker: Prof. Pradeep Ravikumar, Carnegie Mellon University 🔗 Website
- 📅 Date & Time: Tuesday 10 June 2025, 14:00 - 15:00
- 📍 Venue: JDB Seminar Room, CUED
Abstract
Large Language Models (LLMs) have achieved remarkable fluency and versatility—but understanding how they represent meaning internally remains a challenge. In this talk, we explore the emerging science of latent concepts in LLMs: the semantic abstractions implicitly encoded in their internal activations.
We examine how concepts—such as truthfulness, formality, or sentiment—can be represented as low-dimensional structures, discovered through training dynamics, and understood through the lens of linear algebra and associative memory. We discuss the implications for interpretability, robustness, and control, including how concepts can be steered at test time to adjust model behavior without retraining. Specifically, we explore empirical and theoretical evidence supporting the linear representation hypothesis, where such concepts correspond to vectors or affine subspaces, emerging naturally from training dynamics and next-token prediction objectives. We further show that LLMs behave as associative memory systems, retrieving outputs based on latent similarity rather than logical inference. This behavior underlies phenomena such as context hijacking, where semantically misleading prompts can bias the model’s response.
We introduce formal latent concept models that unify these ideas, describe conditions under which concepts are identifiable, and propose learning algorithms for extracting interpretable, controllable representations. We argue that such latent concept modeling offers a principled framework for bridging representation learning with interpretability and model alignment, and offers a promising path toward safer, more controllable, and more trustworthy AI.
Series This talk is part of the Probabilistic Systems, Information, and Inference Group Seminars series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- Featured lists
- Information Engineering Division seminar list
- Interested Talks
- JDB Seminar Room, CUED
- ndk22's list
- ob366-ai4er
- Probabilistic Systems, Information, and Inference Group Seminars
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Prof. Pradeep Ravikumar, Carnegie Mellon University 
Tuesday 10 June 2025, 14:00-15:00