Confidence estimation for attention-based encoder-decoder models for speech recognition
- đ¤ Speaker: Qiujia Li (University of Cambridge)
- đ Date & Time: Monday 06 June 2022, 12:00 - 13:00
- đ Venue: Zoom: https://eng-cam.zoom.us/j/81927138251?pwd=TVd3MXliV003dUdYVlFwU2NDWGpmdz09
Abstract
Abstract: Confidence scores have been an intrinsic part of a conventional speech recogniser. As end-to-end ASR models such as attention-based encoder-decoder models become increasingly popular, it is of great interest to develop reliable confidence estimators for various downstream tasks. In this talk, I will present the confidence estimation module (CEM) for token/word-level confidence scores, and the residual energy-based model (R-EBM) for utterance-level confidence scores for attention-based models. Interestingly, R-EBM can also help improve the ASR performance. Furthermore, some effective techniques for generalising these model-based confidence estimators to out-of-domain data will be discussed.
Bio: Qiujia Li is a fourth-year PhD student at the University of Cambridge, advised by Prof. Phil Woodland. He obtained his BA and MEng also from Cambridge University. His research interests lie primarily in speech processing and machine learning, including end-to-end speech recognition, confidence estimation and speaker diarization. He has published more than a dozen papers at ICASSP , Interspeech, SLT , ASRU, NeurIPS and ICCV , of which two won the best student paper awards at ASRU 2019 and SLT 2021 . He previously worked as a research intern with Microsoft in 2018 and Google in 2020.
Series This talk is part of the CUED Speech Group Seminars series.
Included in Lists
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- CUED Speech Group Seminars
- Guy Emerson's list
- Information Engineering Division seminar list
- PhD related
- Zoom: https://eng-cam.zoom.us/j/81927138251?pwd=TVd3MXliV003dUdYVlFwU2NDWGpmdz09
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Monday 06 June 2022, 12:00-13:00