Linear Attention for Efficient Transformers
- đ¤ Speaker: Isaac Reid (University of Cambridge)
- đ Date & Time: Wednesday 30 October 2024, 11:00 - 12:30
- đ Venue: Cambridge University Engineering Department, CBL Seminar room BE4-38.
Abstract
Attention may be all you need, but that doesn’t mean it comes cheap. The Achilles’ Heel of the wildly successful Transformer architecture is its quadratic time- and space-complexity scaling with respect to the length of the input token sequence. A diverse taxonomy of methods has been proposed to remedy this bottleneck and recover linear complexity, including making attention local, sparse or low rank. We will explore the respective strengths and weaknesses of these approaches, discuss theoretical guarantees (or the lack thereof), and consider possible directions for future work.
Suggested reading:- Attention is all you need (https://arxiv.org/abs/1706.03762). Seminal Transformers paper.
- Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention (https://arxiv.org/abs/2006.16236). Among the first papers on low-rank attention.
- Swin Transformer: Hierarchical Vision Transformer using Shifted Windows (https://arxiv.org/abs/2103.14030). Popular example of local attention.
- Big Bird: Transformers for Longer Sequences (https://arxiv.org/abs/2007.14062). Example of the benefits of using a combination of techniques.
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department, CBL Seminar room BE4-38.
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 30 October 2024, 11:00-12:30