Crowdsourcing big data in English dialectology
- đ¤ Speaker: Bert Vaux (Department of Linguistics, University of Cambridge)
- đ Date & Time: Thursday 12 November 2015, 15:00 - 15:30
- đ Venue: Queen's Building, Emmanuel College
Abstract
The Harvard Dialect Survey of 2002-3 represented the first linguistic foray into large-scale crowdsourcing (60K respondents) incentivized by dynamic geospatial imaging. Working in tandem with statistics graduate student Josh Katz of North Carolina State University I expanded this in 2013 to make the New York Times dialect quiz, which deployed Josh’s brilliant tweaks of existing clustering, visualization, and prediction algorithms to attract responses to my survey questions from more than 21 million humans. Since that time I have been collaborating with forensic linguist Jack Grieve of Aston University to extract linguistically-significant patterns and trends from our megacorpus. In this talk I report on the development of the New York Times quiz and some of the leading discoveries that have emerged from it, including isogloss conspiracies and stability, the role of political and commuting zones, and multivariate non-local cultural regions.
Series This talk is part of the Cambridge Language Sciences Annual Symposium series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge Language Sciences Annual Symposium
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- ob366-ai4er
- Queen's Building, Emmanuel College
- rp587
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Thursday 12 November 2015, 15:00-15:30