Expectations vs. Reality: Lessons learned from Working on Toxic Content Detection in NLP
- đ¤ Speaker: Nedjma Ousidhoum (University of Cambridge) đ Website
- đ Date & Time: Friday 21 January 2022, 12:00 - 13:00
- đ Venue: Virtual (Zoom)
Abstract
Join Zoom Meeting https://cl-cam-ac-uk.zoom.us/j/99831805544?pwd=NUMrTGE4K2U3V2h0NlhtTHNsOG5rQT09
Meeting ID: 998 3180 5544 Passcode: 779252
In order to improve the online moderation process, there has been an increasing need for building toxic language detection tools that do not only flag bad words, but rather filter out toxic content in a more nuanced fashion. In order to train such models, it is essential to acquire data of high quality. However, in the absence of universal definitions of terms such as hate speech, and given the typical data collection process based on keywords, available corpora are usually sparse and imbalanced which makes the detection process challenging for current machine learning techniques.
In this talk, I will present my findings when working on (1) the construction of multilingual resources for robust toxic language and hate speech detection, (2) the study of bias in toxic language detection, and (3) the assessment of toxicity and harmful biases within Large Pre-trained Language Models (PTLMs) which are at the core of major NLP systems.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- Virtual (Zoom)
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)



Friday 21 January 2022, 12:00-13:00