‘Profit factory’ and ‘bathroom break’: How to analyse compounds and how to predict their emergence
- 👤 Speaker: Lonneke van der Plas (University of Malta)
- 📅 Date & Time: Wednesday 18 September 2019, 12:00 - 13:00
- 📍 Venue: FW26, Computer Laboratory
Abstract
Compounds can be defined as the formation of a new lexeme by adjoining two or more lexemes (Bauer, 2003:40). They are studied extensively in linguistic literature and are enjoying more and more attention in the field of Natural Language Processing (NLP). Compounding is a very productive word formation process. English-speaking children can create novel compounds in spontaneous speech from a very young age (Clark, 1981). As a consequence, compounds are a very common word type, but many occur with a very low token count. The high productivity of compounds makes compositional approaches to automatic processing indispensable. Also, it raises questions about the processes that underlie the generation of novel compounds.
I will give an overview of recent work we undertook that harvests parallel corpora as indirect supervision for two tasks: compound identification, and bracketing of compounds. I will then discuss the potential of compounds as vehicles for creative thought and present some experiments that aim to predict novel compounds.
References
Bauer, L. 2003. Introducing Linguistic Morphology, 2nd edn., Washington, DC: Georgetown University Press.
Clark, E. V. (1981). Lexical innovations. How children learn to create new words. In W. Deutsch (Ed.), The child’s construction of language, London: Acad. Press.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Lonneke van der Plas (University of Malta)
Wednesday 18 September 2019, 12:00-13:00