University of Cambridge > Talks.cam > DAMTP ML for Science Reading Group > AI Alignment & RL in Turbulent Environments

Log in

Google

Microsoft

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

AI Alignment & RL in Turbulent Environments

Download to your calendar using vCal

Leo Thom (Uni of Cambridge), Yixuan Zhu (Imperial Col.)
Wednesday 18 March 2026, 11:00-12:00
MR10, Centre for Mathematical Sciences.

If you have a question about this talk, please contact Liz Tan .

We have two talks for the final journal club of Lent!

1. Stress Testing Deliberative Alignment for Anti-Scheming Training – Leo Thom

Can AI models secretly pursue their own goals while appearing aligned? This paper by OpenAI and Apollo Research shows they can — demonstrating sandbagging, self-grading manipulation, and strategic deception across all major frontier models. We examine their proposed fix, its ~30× reduction in scheming, and a critical caveat about situational awareness that complicates the results. No AI safety background assumed.

2. Navigation with Reinforcement Learning in Turbulent Environments – Yixuan Zhu (Imperial)

Autonomous navigation in turbulent atmospheres presents a unique challenge, characterized by uncertain causal relationships and incomplete environmental information. In this talk, we will explore thermal soaring, the process by which birds and gliders harvest energy from ascending air currents to remain airborne without propulsion. We will examine two key studies by Reddy et al. that utilize Reinforcement Learning (RL) to address this problem. First, we will discuss how gliders can learn effective soaring strategies in turbulent flow simulations . We will then look at the transition to real-world applications, where RL algorithms trained on field data enabled model gliders to outperform non-trained counterparts by identifying and tracking thermals.

This talk is part of the DAMTP ML for Science Reading Group series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

AI Alignment & RL in Turbulent Environments

📅 Download to calendar (vCal)

👤 Speaker: Leo Thom (Uni of Cambridge), Yixuan Zhu (Imperial Col.)
📅 Date & Time: Wednesday 18 March 2026, 11:00 - 12:00
📍 Venue: MR10, Centre for Mathematical Sciences

Questions? Contact Liz Tan

Abstract

We have two talks for the final journal club of Lent!

1. Stress Testing Deliberative Alignment for Anti-Scheming Training – Leo Thom

2. Navigation with Reinforcement Learning in Turbulent Environments – Yixuan Zhu (Imperial)

Series This talk is part of the DAMTP ML for Science Reading Group series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

AI Alignment & RL in Turbulent Environments

This talk is included in these lists:

AI Alignment & RL in Turbulent Environments

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

AI Alignment &amp; RL in Turbulent Environments

This talk is included in these lists:

Other lists

Other talks

AI Alignment &amp; RL in Turbulent Environments

Abstract

Included in Lists

AI Alignment & RL in Turbulent Environments

AI Alignment & RL in Turbulent Environments