Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Analysis
- ๐ค Speaker: Dr Sharu Jose, University of Birmingham ๐ Website
- ๐ Date & Time: Wednesday 25 October 2023, 14:00 - 15:00
- ๐ Venue: MR5, CMS Pavilion A
Abstract
Decision-making in the face of uncertainty is a practical challenge found across various areas such as control and robotics, clinical trials, communication, and ecology. An extensively studied decision-making framework is that of stochastic contextual bandits (CBs) which uses side information, termed context, for sequential decision making. Prior research on CBs has mostly focussed on models where the contexts are well-defined. This, however, is not true in real-world applications where the contexts are either noisy or are indicative of predictive measurements. In this talk, we focus on noisy CBs where the learner observes only a noisy, corrupted, version of the true context through an unknown noise channel. We introduce a Thompson Sampling algorithm for Gaussian bandits with Gaussian context noise that can โapproximateโ the action policy of an oracle which has access to the predictive distribution of the true context from the observed noisy context. Using information-theoretic tools, we study the Bayesian regret of the proposed algorithm.
Series This talk is part of the Information Theory Seminar series.
Included in Lists
- All CMS events
- All Talks (aka the CURE list)
- bld31
- CMS Events
- DPMMS info aggregator
- DPMMS lists
- DPMMS Lists
- Hanchen DaDaDash
- Information Theory Seminar
- Interested Talks
- MR5, CMS Pavilion A
- School of Physical Sciences
- Statistical Laboratory info aggregator
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)



Wednesday 25 October 2023, 14:00-15:00