Multi-view Learning of Speech Feature Spaces
- đ¤ Speaker: Karen Livescu (TTI-Chicago)
- đ Date & Time: Tuesday 15 September 2009, 16:00 - 17:00
- đ Venue: LR4, Engineering Department, Baker Building
Abstract
Many learning tasks (classification, regression, clustering) can be improved when multiple views of the data are available. The meaning of “views” may be a natural one like audio vs. images vs. text, or more abstract like arbitrary subsets of the observation vector. Multi-view learning algorithms, such as co-training, take advantage of the relationships between the views. In this work, we explore two-view learning of feature spaces: Given two views of the training data, we learn a transformation of each view that, in some sense, best predicts the other view. Importantly, we can then apply the learned transformations even when only one view (e.g. audio) is available at test time. For this talk, I will focus on work using canonical correlation analysis (CCA), in which a linear projection of each view is learned, such that the two views’ projections are maximally correlated. I will describe recent experiments showing improvements on clustering tasks (speaker clustering of audio and/or video and topic clustering of Wikipedia pages) and on a speaker identification task. Time permitting, I will describe additional ongoing work in speech and language at TTI -C.
Joint work with Kamalika Chaudhuri (UCSD), Sham Kakade (TTI-C), Karthik Sridharan (TTI-C), and Mark Stoehr (U. Chicago)
Series This talk is part of the Machine Intelligence Laboratory Speech Seminars series.
Included in Lists
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- CUED Speech Group Seminars
- Guy Emerson's list
- Information Engineering Division seminar list
- LR4, Engineering Department, Baker Building
- Machine Intelligence Laboratory Speech Seminars
- PhD related
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Karen Livescu (TTI-Chicago)
Tuesday 15 September 2009, 16:00-17:00