University of Cambridge > Talks.cam > Language Technology Lab Seminars > Urban Dictionary Embeddings for Slang NLP Applications

Log in

Google

Microsoft

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Urban Dictionary Embeddings for Slang NLP Applications

Download to your calendar using vCal

Dr Barbara McGillivray (University of Cambridge and The Alan Turing Institute)
Thursday 30 January 2020, 11:00-12:00
GR04, Faculty of English, 9 West Rd (Sidgwick Site).

If you have a question about this talk, please contact Qianchu Liu .

The choice of the corpus on which word embeddings are trained can have a sizable effect on the learned representations, the types of analyses that can be performed with them, and their utility as features for machine learning models. In this talk I will present my work on the first set of word embeddings trained on the content of Urban Dictionary, a crowd-sourced dictionary for slang words and phrases. I will show that although these embeddings are trained on fewer total tokens, they have high performance across a range of common word embedding evaluations, ranging from semantic similarity to word clustering tasks. Further, for some extrinsic tasks such as sentiment analysis and sarcasm detection where we expect to require some knowledge of colloquial language on social media data, initializing classifiers with the Urban Dictionary Embeddings resulted in improved performance compared to initializing with a range of other well-known, pre-trained embeddings that are order of magnitude larger in size.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Urban Dictionary Embeddings for Slang NLP Applications

📅 Download to calendar (vCal)

👤 Speaker: Dr Barbara McGillivray (University of Cambridge and The Alan Turing Institute)
📅 Date & Time: Thursday 30 January 2020, 11:00 - 12:00
📍 Venue: GR04, Faculty of English, 9 West Rd (Sidgwick Site)

Questions? Contact Qianchu Liu

Abstract

Series This talk is part of the Language Technology Lab Seminars series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Urban Dictionary Embeddings for Slang NLP Applications

This talk is included in these lists:

Urban Dictionary Embeddings for Slang NLP Applications

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Urban Dictionary Embeddings for Slang NLP Applications

This talk is included in these lists:

Other lists

Other talks

Urban Dictionary Embeddings for Slang NLP Applications

Abstract

Included in Lists