AI Metrics: Theoretical Foundations, Design, and Selection of Evaluation Metrics Based on Ground Truth
- 👤 Speaker: Enrique Amigó (National University of Distance Education, Madrid, Spain) 🔗 Website
- 📅 Date & Time: Friday 20 March 2026, 12:00 - 13:00
- 📍 Venue: SS03 Hybrid (In-Person + Online). Here is the Google Meet Link: https://meet.google.com/cru-hcuo-rhu
Abstract
In this talk (based on a book draft, see this link) I propose a unified formal framework for ground truth based evaluation metrics and task characterization grounded in measurement theory. Building on this foundation, I analyze the formal properties of existing metrics and organize them into families according to task characteristics. The book covers a wide range of discriminative tasks, including classification, ranking, clustering, and sequence labelling, among others, as well as text generation. It also provides practical guidance for selecting appropriate metrics depending on the evaluation scenario, together with a unified software framework that implements metrics across multiple tasks. Finally, the book extends evaluation beyond effectiveness to additional dimensions of AI quality, such as harmfulness, bias and fairness, explainability, and the assessment of cognitive capabilities.
Bio: Enrique Amigó is an Assistant Professor at the National University of Distance Education (UNED, Spain) and a member of UNED ’s Natural Language Processing and Information Retrieval group. His main research interests include evaluation metrics, document similarity, representation, and the connections between Information Access, Measurement, Information Theory, and cognitive science. He has received more than 3,000 citations, in most cases as first author. He has also participated in numerous research projects at the regional, national, and international levels.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- SS03 Hybrid (In-Person + Online). Here is the Google Meet Link: https://meet.google.com/cru-hcuo-rhu
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Enrique Amigó (National University of Distance Education, Madrid, Spain) 
Friday 20 March 2026, 12:00-13:00