University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Structured Offline Reinforcement Learning via Reward Filtering and Orthogonal Q-Contrasts

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Structured Offline Reinforcement Learning via Reward Filtering and Orthogonal Q-Contrasts

Download to your calendar using vCal

Angela Zhou (University of Southern California)
Tuesday 03 March 2026, 11:00-11:45
Seminar Room 1, Newton Institute.

If you have a question about this talk, please contact nobody.

CIFW02 - Causal identification and discovery

We study offline reinforcement learning under structural conditions where the dynamics may depend on many state variables, but optimal decisions depend only on a sparse, reward-relevant subset of the state. This “decision-theoretic sparsity” that optimal policy and value functions admit lower-dimensional structure, although full-state transition estimation can be difficult. First, we develop a reward-relevance-filtered approach for linear function approximation that modifies thresholded Lasso within least-squares policy evaluation and fitted Q-iteration to focus estimation on reward-relevant components. Second to improve robustness, we propose a structured difference-of-Q framework via orthogonal learning: a dynamic generalization of R-learning that targets Q-function contrasts sufficient for policy optimization, accommodates black-box nuisance estimators of Q and the behavior policy, and yields robust policy optimization guarantees under a margin condition. Together, these methods formalize and exploit reward-relevant structure to improve statistical efficiency and robustness in offline RL.

This talk is part of the Isaac Newton Institute Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Structured Offline Reinforcement Learning via Reward Filtering and Orthogonal Q-Contrasts

📅 Download to calendar (vCal)

⚠️ Important: CIFW02 - Causal identification and discovery

👤 Speaker: Angela Zhou (University of Southern California)
📅 Date & Time: Tuesday 03 March 2026, 11:00 - 11:45
📍 Venue: Seminar Room 1, Newton Institute

Questions? Contact the organiser

Abstract

Series This talk is part of the Isaac Newton Institute Seminar Series series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Structured Offline Reinforcement Learning via Reward Filtering and Orthogonal Q-Contrasts

This talk is included in these lists:

Structured Offline Reinforcement Learning via Reward Filtering and Orthogonal Q-Contrasts

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Structured Offline Reinforcement Learning via Reward Filtering and Orthogonal Q-Contrasts

This talk is included in these lists:

Other lists

Other talks

Structured Offline Reinforcement Learning via Reward Filtering and Orthogonal Q-Contrasts

Abstract

Included in Lists