University of Cambridge > Talks.cam > Language Technology Lab Seminars > You Know It or You Don’t: Categorical Differences in Language Model Behavior

Log in

Google

Microsoft

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

You Know It or You Don’t: Categorical Differences in Language Model Behavior

Download to your calendar using vCal

Naomi Saphra (Harvard University & Boston University)
Thursday 12 February 2026, 16:30-17:30
https://cam-ac-uk.zoom.us/j/86890624365?pwd=oYGWpY7d5r3JOaUCaJXTD0sRECFxab.1.

If you have a question about this talk, please contact Lucas Resck .

Abstract: While years of scientific research on model training and scaling assume that learning is a gradual and continuous process, breakthroughs on specific capabilities have drawn wide attention. Why are breakthroughs so exciting? Because humans don’t naturally think in continuous gradients, but in discrete conceptual categories. If artificial language models naturally learn discrete conceptual categories, perhaps model understanding is within our grasp. I will describe what we know of categorical learning in language models, and how discrete concepts are identifiable through empirical training dynamics and through random variation between training runs. These concepts involve syntax learning, weight mechanisms, and interpretable patterns—-all of which can predict model behavior. By leveraging categorical learning, we can ultimately understand a model’s natural conceptual structure and evaluate our understanding through testable predictions.

Bio: Naomi Saphra is a research fellow at the Kempner Institute at Harvard University and incoming faculty at Boston University in 2026. Naomi is interested in empirically understanding training in NLP and language models: how models learn to encode linguistic patterns or other structure and how we can encode useful inductive biases into the training process. Recently, she has begun collaborating with natural and social scientists to use interpretability to understand the world around us. She is particularly interested in fish. Previously, she earned a PhD from the University of Edinburgh on Training Dynamics of Neural Language Models; worked at NYU , Google, MosaicML, and Facebook; and attended Johns Hopkins and Carnegie Mellon University. Outside of research, she plays roller derby under the name Gaussian Retribution and performs standup comedy.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

You Know It or You Don’t: Categorical Differences in Language Model Behavior

📅 Download to calendar (vCal)

👤 Speaker: Naomi Saphra (Harvard University & Boston University)
📅 Date & Time: Thursday 12 February 2026, 16:30 - 17:30
📍 Venue: https://cam-ac-uk.zoom.us/j/86890624365?pwd=oYGWpY7d5r3JOaUCaJXTD0sRECFxab.1

Questions? Contact Lucas Resck

Abstract

Series This talk is part of the Language Technology Lab Seminars series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

You Know It or You Don’t: Categorical Differences in Language Model Behavior

This talk is included in these lists:

You Know It or You Don’t: Categorical Differences in Language Model Behavior

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

You Know It or You Don’t: Categorical Differences in Language Model Behavior

This talk is included in these lists:

Other lists

Other talks

You Know It or You Don’t: Categorical Differences in Language Model Behavior

Abstract

Included in Lists