Learning Conditional Random Fields with Hierarchical Features: Application to the Game of Go
- đ¤ Speaker: Scott Sanner, University of Toronto
- đ Date & Time: Wednesday 29 November 2006, 15:00 - 16:00
- đ Venue: Small public lecture room, Microsoft Research Ltd, 7 J J Thomson Avenue (Off Madingley Road), Cambridge
Abstract
We examine an important subtask of policy learning in the game of Go: approximating the value function given a fixed policy. We model the value function as the expected territory outcome of a Go board configuration and learn to predict this outcome using a conditional Markov random field (CRF). This task is complicated by the complexity of inference on a Go Board (361 individual territories to predict â all influenced by surrounding positions) and the use of 4 million pattern-based features. Such complexity induces many computational and statistical problems which must be accounted for during both training and inference. In this work we examine a variety of models (Independent vs. Coupled, Flat vs. Hierarchical), learning algorithms (Local Training vs. Max Likelihood vs. Max Pseudo-likelihood), and inference approaches (Loopy BP vs. Sampling, Bayesian Model Averaging vs. Heuristic Model Selection). We present results from learning to predict territory in expert games and conclude with a prescription for future work on approximating the value function in Go. This is joint work with Thore Graepel and Ralf Herbrich with contributions by Tom Minka.
Series This talk is part of the Microsoft Research Machine Learning and Perception Seminars series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- Interested Talks
- Machine Learning Summary
- Microsoft Research Cambridge, public talks
- Microsoft Research Machine Learning and Perception Seminars
- ML
- ndk22's list
- ob366-ai4er
- Optics for the Cloud
- personal list
- PMRFPS's
- rp587
- School of Technology
- Small public lecture room, Microsoft Research Ltd, 7 J J Thomson Avenue (Off Madingley Road), Cambridge
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Scott Sanner, University of Toronto
Wednesday 29 November 2006, 15:00-16:00