BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:Mind the Data - Noah Smith\, University of Washington 
DTSTART:20230608T150000Z
DTEND:20230608T160000Z
UID:TALK202240@talks.cam.ac.uk
CONTACT:Panagiotis Fytas
DESCRIPTION:Today’s mainstream NLP research focuses on general-purpose m
 odels that are scaled up to work with extremely large datasets.  This dire
 ction has had many benefits\, evidenced by performance on research benchma
 rks and by new use cases for AI in general\, and language models specifica
 lly\, imagined by an ever wider community of stakeholders. What I believe 
 is coming next is a strong demand for customization. More people than ever
  will want to adapt language models to create new applications. To enable 
 them\, I believe we need new affordances for working with the most importa
 nt ingredient for NLP systems: the data. In this talk\, I’ll present rec
 ent work from my group showing benefits and risks of new methods for data 
 selection\, organization\, and synthesis. I’ll advocate for a future in 
 which artifacts like language models are developed to support adaptation t
 o unexpected and diverging demands of a wide population of users\, who in 
 turn should be empowered to direct models to serve their own interests.\n
LOCATION:https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBd
 XVpOXFvdz09
END:VEVENT
END:VCALENDAR
