University of Cambridge > Talks.cam > Language Technology Lab Seminars > Rethinking Benchmarking in AI

Log in

Google

Microsoft

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Rethinking Benchmarking in AI

Download to your calendar using vCal

Douwe Kiela, Facebook AI Research
Thursday 04 March 2021, 17:00-18:00
https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09.

If you have a question about this talk, please contact Marinela Parovic .

The current benchmarking paradigm in AI has many issues: benchmarks saturate quickly, are susceptible to overfitting, contain exploitable annotator artifacts, have unclear or imperfect evaluation metrics, and do not measure what we really care about. I will talk about my work in trying to rethink the way we do benchmarking in AI, specifically in natural language processing, focusing mostly on the recently launched Dynabench platform.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Rethinking Benchmarking in AI

This talk is included in these lists:

Rethinking Benchmarking in AI

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Rethinking Benchmarking in AI

This talk is included in these lists:

Other lists

Other talks

Rethinking Benchmarking in AI

Abstract

Included in Lists