Challenges and results of large-scale mapping of contemporary English dialects using online surveys
- đ¤ Speaker: Dr Bert Vaux (Department of Linguistics, University of Cambridge)
- đ Date & Time: Thursday 02 November 2006, 17:00 - 18:30
- đ Venue: GR06-7, English Faculty, 9 West Road (Sidgwick Site)
Abstract
Linguists are just beginning to mine the web (typically via Google) for primary linguistic data (cf. Nakov and Hearst 2005, A Study of Using Search Engine Page Hits as a Proxy for n-gram Frequencies, or Nicholson and Baldwin 2006, Interpretation of Compound Nominalisations using Corpus and Web Statistics). Back in 1997, tired of saving hundreds of handwritten student surveys and having to present students with generalisations about English dialects that ceased being true 75 years ago, I decided to create an engine for collecting linguistic survey data via the web. My hope was to collect up-to-date dialect data and lots of it, in a form that could be directly dumped into a database for statistical analysis and geographic visualisation. Nine years and about eight surveys later, a host of interesting and surprising results have emerged concerning the current form of English varieties around the world. A number of unanticipated challenges have arisen as well. In this talk I present some of the most striking of these findings and problems.
Series This talk is part of the Cambridge University Linguistic Society (LingSoc) series.
Included in Lists
- All Talks (aka the CURE list)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Linguistic Society (LingSoc)
- Chris Davis' list
- GR06-7, English Faculty, 9 West Road (Sidgwick Site)
- Guy Emerson's list
- Language Sciences for Graduate Students
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Dr Bert Vaux (Department of Linguistics, University of Cambridge)
Thursday 02 November 2006, 17:00-18:30