Concept Embedding Models: Beyond the Accuracy-Explainability Trade-off
- đ¤ Speaker: Mateo Espinosa Zarlenga (University of Cambridge) đ Website
- đ Date & Time: Tuesday 24 January 2023, 13:00 - 14:00
- đ Venue: Lecture Theatre 2, Computer Laboratory, William Gates Building and Zoom
Abstract
Join us in Lecture Theatre 2 or on Zoom
Deploying AI-powered systems requires trustworthy models supporting effective human interactions, going beyond raw prediction accuracy. Concept bottleneck models promote trustworthiness by conditioning classification tasks on an intermediate level of human-like concepts. This enables human interventions which can correct mispredicted concepts to improve the model’s performance. However, existing concept bottleneck models are unable to find optimal compromises between high task accuracy, robust concept-based explanations, and effective interventions on concepts—particularly in real-world conditions where complete and accurate concept annotations are scarce. In this talk I will describe Concept Embedding Models, a novel family of concept bottleneck models which goes beyond the current accuracy-vs-interpretability trade-off by learning interpretable high-dimensional concept representations. Our experiments demonstrate that Concept Embedding Models (a) attain better or competitive task accuracy w.r.t. standard neural models without concepts, (b) provide concept representations capturing meaningful semantics including and beyond their ground truth labels, (c) support test-time concept interventions whose effect in test accuracy surpasses that in standard concept bottleneck models, and (d) scale to real-world conditions where complete concept supervisions are scarce.
Series This talk is part of the Artificial Intelligence Research Group Talks (Computer Laboratory) series.
Included in Lists
- All Talks (aka the CURE list)
- Artificial Intelligence Research Group Talks (Computer Laboratory)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Department of Computer Science and Technology talks and seminars
- Guy Emerson's list
- Hanchen DaDaDash
- Interested Talks
- Lecture Theatre 2, Computer Laboratory, William Gates Building and Zoom
- Martin's interesting talks
- ndk22's list
- ob366-ai4er
- PhD related
- rp587
- School of Technology
- Speech Seminars
- Trust & Technology Initiative - interesting events
- Trustworthy and Responsible Machine Learning / AI
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)



Tuesday 24 January 2023, 13:00-14:00