Model Merging — A Tale of Two Settings
- 👤 Speaker: Donato Crisostomi
- 📅 Date & Time: Friday 21 November 2025, 11:00 - 12:00
- 📍 Venue: GR06/07, English Faculty Building, 9 West Road, Sidgwick Site and online https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
Abstract
Abstract: In this talk, I will introduce the emerging field of model merging: the process of combining multiple neural networks into a single model without retraining. We’ll begin with foundational concepts such as linear mode connectivity and task vectors, and explore two main settings: (1) merging models trained from scratch on the same task but with different initializations, and (2) merging models finetuned on different tasks from a shared pretrained base. I will then present a series of recent works that expand the model merging toolkit. These include the use of cycle consistency in permutation-based merging, insights into how task vectors relate to gradients, SVD -based approaches for low-rank model fusion, and the application of evolutionary algorithms to discover optimal merging coefficients. Throughout, we’ll see how these techniques can be applied in real-world scenarios, from model compression in computer vision to the synthesis of state-of-the-art LLMs for low-resource languages.
Bio: Donato Crisostomi is an ELLIS PhD student at the Sapienza University of Rome & University of Cambridge, currently interning at Cohere. His research focuses on model merging and representational alignment. He currently leads the “Model Reuse” work package for the 1.5M€ project “NEXUS”. He previously held roles as a visiting researcher at the University of Cambridge, a Research Scientist at Amazon Alexa, and an Applied Scientist at Amazon Search. His research has been featured in top-tier AI conferences and journals, including CVPR , NeurIPS, ACM , ACL, and LoG. In addition to his scientific contributions, he has played an active role in the research community as the organizer of the UniReps workshop at NeurIPS, mentor at LOGML , and as a program committee member for leading conferences such as CVPR , NeurIPS, ICLR , etc.
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- GR06/07, English Faculty Building, 9 West Road, Sidgwick Site and online https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Donato Crisostomi
Friday 21 November 2025, 11:00-12:00