A Hierarchical Bayesian Language Model based on Pitman-Yor Processes
- đ¤ Speaker: Matt Shannon (University of Cambridge)
- đ Date & Time: Thursday 22 January 2009, 14:00 - 15:30
- đ Venue: Engineering Department, CBL Room 438
Abstract
I will be discussing:
- A Hierarchical Bayesian Language Model based on Pitman-Yor Processes, Yee Whye Teh, http://www.gatsby.ucl.ac.uk/~ywteh/research/bayesnlp/acl2006.pdf
N-gram language modelling traditionally uses some form of “smoothing” technique to allocate some probability mass to unseen N-grams. Over the years people have come up with smoothing schemes that perform pretty well, but it’s not easy to get a handle on what they’re doing, and how to improve them.
In this paper, Teh shows that a hierarchical Bayesian language model with a very simplistic model of context performs pretty much as well as the current state of the art smoothing schemes, and in fact has strong similarities to an existing smoothing scheme.
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Engineering Department, CBL Room 438
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Thursday 22 January 2009, 14:00-15:30