Training for Deployment: Methods for Small and Efficient NLP
- 👤 Speaker: Alexander Rush, Cornell Tech
- 📅 Date & Time: Thursday 20 May 2021, 15:00 - 16:00
- 📍 Venue: https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
Abstract
Natural language models for translation and classification work relatively well, or at least well enough that there is demand for widespread use in real systems. Models developed for research however do not naturally translate to deployment scenarios, particularly on resource constrained devices like mobile phones. In this talk I will discuss two axes that make it difficult to deploy NLP models in practice: a) Serial generation in translation models makes them difficult to optimize, and b) Fine-tuned parameter size in classification makes models difficult to deploy to end-users. I propose two approaches that aim to circumvent these issues, and discuss some practical work on deploying large NLP models on edge devices.
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Thursday 20 May 2021, 15:00-16:00