Rich semantic representations for detailed visual recognition
- 👤 Speaker: Subhransu Maji, Toyota Technological Institute at Chicago
- 📅 Date & Time: Wednesday 26 March 2014, 11:00 - 12:00
- 📍 Venue: Auditorium, Microsoft Research Ltd, 21 Station Road, Cambridge, CB1 2FB
Abstract
Several problems in computer vision can be cast as a mapping from input (e.g., images and video) to richly structured spaces (e.g., attributes, 3D layout, and pose). Often the choice of the underlying representation of the input is crucial to the success of automatic methods for such mappings. On one hand, representations that are semantically aligned can enable better human-centric applications, but on the other hand, representations that are not necessarily semantic when learned from `big-data’ tends to have better empirical performance.
I’ll show that with a careful design of the learning/inference method and small amounts of additional supervision, one can learn representations that achieve both the goals. Our methods leverage noisy annotations collected via “crowdsourcing” to discover semantically aligned representations that enable several high-level recognition tasks. In particular, we achieve state of the art results for person detection and attribute recognition on the PASCAL VOC datasets, and material recognition on the KTH -TIPS/Flickr datasets. I’ll also present instances where algorithms consider humans “in the loop” to solve challenging tasks, such as, fine-grained category recognition (e.g. is this bird a Quetzal?), discriminative part/attribute discovery, and to enable faster annotation interfaces.
Series This talk is part of the Microsoft Research Cambridge, public talks series.
Included in Lists
- All Talks (aka the CURE list)
- Auditorium, Microsoft Research Ltd, 21 Station Road, Cambridge, CB1 2FB
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- Interested Talks
- Microsoft Research Cambridge, public talks
- ndk22's list
- ob366-ai4er
- Optics for the Cloud
- personal list
- PMRFPS's
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Subhransu Maji, Toyota Technological Institute at Chicago
Wednesday 26 March 2014, 11:00-12:00