Principles of AI-driven Neuroscience and Translational Biomedicine
- 👤 Speaker: Mate Aller
- 📅 Date & Time: Tuesday 10 March 2026, 16:00 - 17:00
- 📍 Venue: LT1
Abstract
Title: Efficiency and flexibility in an LSTM model of human spoken word recognition
Abstract: Recent advances in artificial neural networks have enabled the design of automatic speech recognition systems that identify spoken words with an accuracy approaching human listeners. By analysing the functional characteristics and internal representations of such systems, and comparing them to human listeners, we can gain novel insights into classic psycholinguistic findings and testable predictions for neuroimaging experiments exploring the neural computations for human speech perception.
Here we build on a recently published end-to-end model of human speech recognition (‘EARSHOT’). This is a recurrent (LSTM) neural network trained to map from acoustic representations of spoken words to representations of word meaning (semantics). It exhibits a human-like time course of word identification with parallel activation (due to phonological overlap) of onset-aligned ‘cohort’ neighbours (e.g. chain/change), and reduced, or delayed competition effects between rhyme neighbours (e.g. chain/gain).
We systematically characterised EARSHOT ’s behaviour in recognising speech across different talkers, speaking rates, and levels of spectral detail. In addition, we analysed the model’s hidden-state dynamics to provide a mechanistic explanation for several of the observed behavioural patterns.
Series This talk is part of the Data Science and AI in Medicine series.
Included in Lists
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 10 March 2026, 16:00-17:00