The Geometry of Machine Translation
- đ¤ Speaker: Rory Waite (University of Cambridge) đ Website
- đ Date & Time: Friday 16 January 2015, 13:30 - 14:30
- đ Venue: Department of Engineering - LR6
Abstract
Most modern statistical machine translation systems are based on linear statistical models. One extremely effective method for estimating the model parameters is minimum error rate training (MERT), which is an efficient form of line search adapted to the highly non-linear objective functions used in machine translation. We will show that MERT can be represented using convex geometry, which is the mathematics of polytopes and their faces. Using this geometric representation of MERT we investigate whether the optimisation of linear models is tractable in general. It has been believed that the number of feasible solutions of a linear model is exponential with respect to the number of sentences used for parameter estimation, however we show that the exponential complexity is instead due to the feature dimension. This result has important ramifications because it suggests that the current trend in building statistical machine translation systems by introducing very large number of sparse features is inherently not robust.
Biography
Rory is a research assistant and a recent graduate student from the University of Cambridge. His research is in statistical machine translation.
Series This talk is part of the CUED Speech Group Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- CUED Speech Group Seminars
- Department of Engineering - LR6
- Guy Emerson's list
- Information Engineering Division seminar list
- Interested Talks
- ndk22's list
- ob366-ai4er
- PhD related
- rp587
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Rory Waite (University of Cambridge) 
Friday 16 January 2015, 13:30-14:30