University of Cambridge > Talks.cam > Probabilistic Systems, Information, and Inference Group Seminars > Latent Concepts in Large Language Models

Log in

Google

Microsoft

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Latent Concepts in Large Language Models

Download to your calendar using vCal

Prof. Pradeep Ravikumar, Carnegie Mellon University
Tuesday 10 June 2025, 14:00-15:00
JDB Seminar Room, CUED.

If you have a question about this talk, please contact Prof. Ramji Venkataramanan .

Large Language Models (LLMs) have achieved remarkable fluency and versatility—but understanding how they represent meaning internally remains a challenge. In this talk, we explore the emerging science of latent concepts in LLMs: the semantic abstractions implicitly encoded in their internal activations.

We examine how concepts—such as truthfulness, formality, or sentiment—can be represented as low-dimensional structures, discovered through training dynamics, and understood through the lens of linear algebra and associative memory. We discuss the implications for interpretability, robustness, and control, including how concepts can be steered at test time to adjust model behavior without retraining. Specifically, we explore empirical and theoretical evidence supporting the linear representation hypothesis, where such concepts correspond to vectors or affine subspaces, emerging naturally from training dynamics and next-token prediction objectives. We further show that LLMs behave as associative memory systems, retrieving outputs based on latent similarity rather than logical inference. This behavior underlies phenomena such as context hijacking, where semantically misleading prompts can bias the model’s response.

We introduce formal latent concept models that unify these ideas, describe conditions under which concepts are identifiable, and propose learning algorithms for extracting interpretable, controllable representations. We argue that such latent concept modeling offers a principled framework for bridging representation learning with interpretability and model alignment, and offers a promising path toward safer, more controllable, and more trustworthy AI.

This talk is part of the Probabilistic Systems, Information, and Inference Group Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Latent Concepts in Large Language Models

📅 Download to calendar (vCal)

👤 Speaker: Prof. Pradeep Ravikumar, Carnegie Mellon University 🔗 Website
📅 Date & Time: Tuesday 10 June 2025, 14:00 - 15:00
📍 Venue: JDB Seminar Room, CUED

Questions? Contact Prof. Ramji Venkataramanan

Abstract

Series This talk is part of the Probabilistic Systems, Information, and Inference Group Seminars series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Latent Concepts in Large Language Models

This talk is included in these lists:

Latent Concepts in Large Language Models

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Latent Concepts in Large Language Models

This talk is included in these lists:

Other lists

Other talks

Latent Concepts in Large Language Models

Abstract

Included in Lists