Achieving Universality in Machine Translation: M4 - Massively Multilingual, Massive MT Models for the Next 1000 Languages
- 👤 Speaker: Orhan Firat, Google Research
- 📅 Date & Time: Thursday 11 March 2021, 11:00 - 12:00
- 📍 Venue: https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
Abstract
What does universality mean for machine translation? Massively multilingual models jointly trained on hundreds of languages, have been showing great success in processing different languages simultaneously in a single large model. These large multilingual models, which we call M4, are appealing for both efficiency and positive cross-lingual transfer: (1) Training and deploying a single multilingual model requires much less resources than maintaining one model for each language considered, (2) by transferring knowledge from high-resource languages, multilingual models are able to improve performance on low-resource languages. In this talk, we will be talking about our efforts on scaling machine translation models to more than 1000 languages. We will be detailing several research (and even some development) challenges that the project has tackled; multi-task learning with hundreds of tasks, learning under heavy data imbalance, understanding the learned representations, evaluation at the tail, cross-lingual down-stream transfer and many more insights will be shared.
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Thursday 11 March 2021, 11:00-12:00