Language and Demographics on Twitter: Inferring Latent User Attributes from Streaming Communications
- 👤 Speaker: Svitlana Volkova, Johns Hopkins University
- 📅 Date & Time: Friday 12 September 2014, 12:00 - 13:00
- 📍 Venue: FW26, Computer Laboratory
Abstract
Content shared locally within a user’s social network can reveal latent attributes of a user. However, not all attributes are pronounced equally given similar amounts of content (some attributes are harder to predict). We explore various network structures on Twitter for the prediction of attributes of varying levels of difficulty (gender, age, and political beliefs), examining the impact of graph-type and amount of available content. We show that even when limited or no self-authored data is available, language from neighbor communications provide sufficient evidence for prediction. We find that a friend graph leads to highest accuracy for gender, while a follower-graph is preferred for age, and a retweet-graph is best for political belief classification.
However, the above models for social media personal analytics assume access to thousands of messages per user, even though most users author content only sporadically over time. Given this sparsity, we: (i) leverage content from the local neighborhood of a user and (ii) estimate the amount of time and tweets required for a dynamic model to predict user preferences. When updating our dynamic models over time, we find that political beliefs can be often predicted using roughly 100 tweets, depending on the context of user selection, where this could mean hours, or weeks, based on the author’s tweeting frequency.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Svitlana Volkova, Johns Hopkins University
Friday 12 September 2014, 12:00-13:00