Linear Transformers for Efficient Sequence Modeling
- π€ Speaker: Prof Yoon Kim, MIT
- π Date & Time: Thursday 23 January 2025, 15:00 - 16:00
- π Venue: https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
Abstract
Abstract:
Transformers are still the dominant architecture for language modeling (and generative AI more broadly). The attention mechanism in Transformers is considered core to the architecture and enables accurate sequence modeling at scale. However, attention requires explicitly modeling pairwise interactions amongst all elements of a sequence, and thus its complexity is quadratic in input length. This talk will describe some recent work from our group on efficient architectural alternatives to Transformers for language modeling, in particular linear Transformers, which can be reparameterized as an RNN and thus allow for linear-time constant-memory sequence modeling. We also provide connections between linear Transformers and recent state-space models such as Mamba.
Bio: Yoon Kim is an assistant professor at MIT (EECS/CSAIL). He obtained his PhD in computer science from Harvard University, where he was advised by Alexander Rush. Prof. Kim works on natural language processing and machine learning. Current interests include: - Efficient training and deployment of large-scale models - Understanding the capabilities and limitations of language models - Symbolic mechanisms for controlling and augmenting neural networks
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Thursday 23 January 2025, 15:00-16:00