A Tutorial on Algorithmic Information Theory in Modern ML
- 👤 Speaker: Szilvia Ujvary, Arik Reuter, Xianda Sun (University of Cambridge)
- 📅 Date & Time: Wednesday 05 November 2025, 11:00 - 12:30
- 📍 Venue: Cambridge University Engineering Department, CBL Seminar room BE4-38.
Abstract
This tutorial explores how ideas from algorithmic information theory connect to modern machine learning through three recent papers. We begin with Solomonoff induction—the theoretically optimal but uncomputable predictor—and show how neural networks can approximate it by training on Universal Turing Machine data (Grau-Moya et al., 2024). We then establish the formal foundations by examining Kolmogorov complexity and its connections to compression and randomness in images, exploring how the Solomonoff prior helps us understand what makes images “realistic” and guides the design of better generative models and anomaly detectors (Theis, 2024). Finally, we demonstrate these principles at scale, deriving non-vacuous generalization bounds for large language models with billions of parameters through compression-based analysis using the SubLoRA technique (Lotfi et al., 2024). No prior background in algorithmic information theory required—we’ll build intuition from first principles while connecting to familiar ML concepts throughout.
Papers:- Learning Universal Predictors (Grau-Moya et al., 2024) – https://arxiv.org/abs/2401.14953
- What Makes an Image Realistic? (Theis, 2024) – https://arxiv.org/abs/2403.04493
- Non-Vacuous Generalization Bounds for Large Language Models (Lotfi et al., 2024) – https://arxiv.org/abs/2312.17173
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department, CBL Seminar room BE4-38.
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Szilvia Ujvary, Arik Reuter, Xianda Sun (University of Cambridge)
Wednesday 05 November 2025, 11:00-12:30