Discriminative Methods with Structure
- đ¤ Speaker: Simon Lacoste-Julien (Univ of California at Berkeley)
- đ Date & Time: Tuesday 11 March 2008, 11:30 - 12:30
- đ Venue: Engineering Department, CBL Room 438
Abstract
Real world problems such as machine translation involve complex dependencies. Generative models have provided an elegant and flexible framework to model those dependencies, but they appear to lack robustness to model misspecification compared to discriminative models for classification. In this talk, we present methods for leveraging the advantages of generative models in the discriminative framework.
In the first part of the talk, we tackle the word alignment problem from natural language processing. We formulate it as a weighted bipartite matching problem and show how to learn the weights by using a large-margin approach for structured prediction. By providing a flexible discriminative modeling framework, we were able to cut the Alignment Error Rate in half compared to the previous best performing generative models for word alignment.
In the second part of the talk, we study probabilistic topic models which have been popular for modeling latent structures in text documents (as bag of words) or images (as bag of visual words). They are usually trained as generative models with maximum likelihood estimation, though this could be suboptimal if one is interested in doing classification. In contrast, we present a discriminative version of the Latent Dirichlet Allocation (LDA) model which attempts to uncover the latent structure in the documents while optimizing its predictive power for the task of classification. We present results in the domains of document classification and scene categorization.
(joint work with Fei Sha, Ben Taskar, Dan Klein and Michael I. Jordan)
Series This talk is part of the Machine Learning @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- Biology
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge Neuroscience Seminars
- Cambridge talks
- CBL important
- Chris Davis' list
- Creating transparent intact animal organs for high-resolution 3D deep-tissue imaging
- dh539
- dh539
- Engineering Department, CBL Room 438
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Joint Machine Learning Seminars
- Life Science
- Life Sciences
- Machine Learning @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- Neuroscience
- Neuroscience Seminars
- Neuroscience Seminars
- ob366-ai4er
- Required lists for MLG
- rp587
- Seminar
- Simon Baker's List
- Stem Cells & Regenerative Medicine
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 11 March 2008, 11:30-12:30