University of Cambridge > Talks.cam > Machine Learning @ CUED > Optimal Control and Reinforcement Learning with Gaussian Process Models

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Optimal Control and Reinforcement Learning with Gaussian Process Models

Download to your calendar using vCal

Marc Deisenroth (University of Cambridge)
Wednesday 07 November 2007, 11:30-12:30
Engineering Department, CBL Room 438.

If you have a question about this talk, please contact Zoubin Ghahramani .

Optimal control and reinforcement learning (RL) have the same objective: optimization of a long-term performance measure. While the system in optimal control problems is usually known, RL has a more general setup, which includes possibly unknown environments. However, after learning a model standard algorithms for optimal control can also be applied to RL.

In this talk a generalization of dynamic programming (DP) to continuous-valued state and action spaces is given. The proposed algorithm (GPDP) combines Gaussian process (GP) models with DP and yields an approximate optimal closed-loop policy on the entire state space. We apply GPDP to the underactuated pendulum swing up. For exactly known environments we show that GPDP yields an close-to optimal solution. Moreover, we show that GPDP can successfully be applied to stochastic optimal control problems.

This talk is part of the Machine Learning @ CUED series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Optimal Control and Reinforcement Learning with Gaussian Process Models

This talk is included in these lists:

Optimal Control and Reinforcement Learning with Gaussian Process Models

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Optimal Control and Reinforcement Learning with Gaussian Process Models

This talk is included in these lists:

Other lists

Other talks

Optimal Control and Reinforcement Learning with Gaussian Process Models

Abstract

Included in Lists