Hidden Biases. Ethical Issues in NLP, and What to Do about Them
- đ¤ Speaker: Dirk Hovy, Bocconi University in Milan, Italy
- đ Date & Time: Thursday 23 January 2020, 11:00 - 12:00
- đ Venue: GR04, Faculty of English, 9 West Rd (Sidgwick Site)
Abstract
Texts reflect the authors’ demographic properties and biases, which in turn get magnified by statistical NLP models. This has unintended consequences for our analysis: if we do not pay attention to the biases contained, we can easily draw the wrong conclusions, and create disadvantages for our users.
In this talk, I will discuss several types of biases that affect NLP models, what their sources are, and potential counter measures. - bias stemming from data, i.e., selection bias (if our texts do not adequately reflect the population we want to study), label bias (if the labels we use are skewed), and semantic bias (the latent stereotypes encoded in embeddings). - biases deriving from the models themselves, i.e., their tendency to amplify any imbalances that are present in the data. - design bias, i.e., the biases arising from our (the researchers) decisions which topics to analyze, which data sets to use, and what to do with them.
For each bias, I will provide examples and discuss the possible ramifications for a wide range of applications, and who various ways to address and counteract these biases, ranging from simple labeling considerations to new types of models.
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- GR04, Faculty of English, 9 West Rd (Sidgwick Site)
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Thursday 23 January 2020, 11:00-12:00