Machine learning models guide viral discovery in museum bat collections
- ๐ค Speaker: Maya M. Juman, Department of Veterinary Medicine
- ๐ Date & Time: Monday 24 November 2025, 12:30 - 13:00
- ๐ Venue: SS03 Seminar Room, Willam Gates building (Department of Computer Science and Technology)
Abstract
Natural history museum collections are valuable but underutilized resources for viral discovery, offering opportunities to test hypotheses about pathogen occurrence across space, time, and taxonomic groups. We developed trait-based machine learning models of bat host suitability to guide viral screening of 1821 tissues in a museum collection. Our coronavirus and paramyxovirus predictive models performed with 79% and 92% predictive accuracy, respectively, and we used these models to generate ranked lists of suspect โnovelโ host species for screening. For the first time, we recovered these viruses from archived museum tissues, confirming three novel coronavirus host species and three novel paramyxovirus host species (3% and 33% prediction success rate, respectively). These sequences included a SARS -like coronavirus from an Angolan bat collected in June 2019, suggesting that viruses with epidemic potential may be more widespread in sub-Saharan Africa than previously believed. This case study lays out a framework for using predictive machine learning models to unlock pathogen data hidden in historical specimens.
Series This talk is part of the Accelerate Lunchtime Seminar Series series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- Interested Talks
- Interested Talks
- ndk22's list
- ob366-ai4er
- rp587
- SS03 Seminar Room, Willam Gates building (Department of Computer Science and Technology)
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Monday 24 November 2025, 12:30-13:00