Speech Recognition: What’s Left?
- 👤 Speaker: Dr Michael Picheny
- 📅 Date & Time: Tuesday 12 November 2019, 12:00 - 13:00
- 📍 Venue: Department of Engineering - LT1
Abstract
Recent speech recognition advances on the SWITCHBOARD corpus suggest that because of recent advances in Deep Learning, we now achieve Word Error Rates comparable to human listeners. Does this mean the speech recognition problem is solved and the community can move on to a different set of problems? In this talk, we examine speech recognition issues that still plague the community and compare and contrast them to what is known about human perception. We specifically highlight issues in accented speech, noisy/reverberant speech, speaking style, rapid adaptation to new domains, and multilingual speech recognition. We try to demonstrate that compared to human perception, there is still much room for improvement, so significant work in speech recognition research is still required from the community.
Series This talk is part of the Information Engineering Distinguished Lecture Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- CUED Speech Group Seminars
- Department of Engineering - LT1
- Featured lists
- Guy Emerson's list
- Information Engineering Distinguished Lecture Series
- Information Engineering Division seminar list
- Interested Talks
- ndk22's list
- ob366-ai4er
- PhD related
- Probabilistic Systems, Information, and Inference Group Seminars
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 12 November 2019, 12:00-13:00