LLMs, Implicit Bayesian inference and compositional Generalization
- đ¤ Speaker: Szilvia Ujvary (University of Cambridge)
- đ Date & Time: Friday 20 June 2025, 12:00 - 13:00
- đ Venue: ONLINE ONLY. Here is the Zoom link: https://cam-ac-uk.zoom.us/j/4751389294?pwd=Z2ZOSDk0eG1wZldVWG1GVVhrTzFIZz09
Abstract
Abstract
Apparently rational behaviors of autoregressive LLMs, such as in-context learning, have been attributed to implicit Bayesian inference: since training data is best explained as a mixture, the optimal next-token-predictor learns to implicitly infer latent concepts and completes prompts consistently with Bayesian inference. Although it is optimal in-distribution, Bayesian inference is generally suboptimal on out-of-distribution prompts due to model misspecification. As model behavior on OOD prompts is only weakly constrained by pretraining, it is not guaranteed that Bayesian behavior is extrapolated OOD . In this talk, we investigate with small-scale experiments the degree to which Bayesian inference remains a good model of LM behavior on OOD prompts. We first review related approaches from the literature. Then, focusing on small-scale compositional tasks – learning rules of formal languages – we show that Transformers can solve harder tasks than trained on, even in settings where the Bayes posterior is undefined. We highlight the role of task compositionality as a useful inductive bias in enabling models to learn more than the training data.
Speaker Biography Szilvia is a second-year PhD student working with Professor Ferenc HuszĂĄr. Szilvia’s research focuses on explaining emergent abilities of LLMs, such as inâcontext learning and outâofâdistribution generalisation, as well as related foundational questions in (algorithmic) information theory.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- ONLINE ONLY. Here is the Zoom link: https://cam-ac-uk.zoom.us/j/4751389294?pwd=Z2ZOSDk0eG1wZldVWG1GVVhrTzFIZz09
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Friday 20 June 2025, 12:00-13:00