A Data-Free, Universal Prior Distribution for Syntactic Structures
- 👤 Speaker: Dr Fermin Moscoso del Prado Martin - Department of Computer Science and Technology, University of Cambridge
- 📅 Date & Time: Wednesday 28 February 2024, 15:05 - 15:55
- 📍 Venue: Lecture Theatre 1, Computer Laboratory, William Gates Building
Abstract
Across linguistic theories, human language structures are represented by graphs. Much research has focused on the linearisation of such graphs into actual sequences expressing utterances, but less attention has been paid to the shapes that the graphs themselves take: their topology. A current hypothesis from psycholinguistics argues that the structures in human language are primarily shaped by the nature of human language production processes. Utterances are planned in an incremental manner: successively incorporating chunks –either single words, phrases, or even full clauses– into partial syntactic structures. Incremental construction should constrain the plausible probability distributions of syntactic structures in predictable ways. I show that the topologies of actual syntactic graphs exhibit the precise deviation from randomness that incremental construction predicts. This is a previously unknown universal regularity of human languages: Syntactic structures are constrained to a predictable topological distribution –that generated by sublinear preferential attachment– constant for all 124 languages studied, across language families and modalities (spoken, written, and signed). It supports the hypothesis that syntactic structures are mainly shaped by language production. Furthermore, it demonstrates how the observed efficiency of languages might just be epiphenomenal. Crucially, this finding implicitly defines a data-free universal prior distribution for parse structures, with possible applications in language technologies.
Link to join virtually: https://cam-ac-uk.zoom.us/j/81322468305
A recording of this talk is available at the following link: https://www.cl.cam.ac.uk/seminars/wednesday/video/
Series This talk is part of the Wednesday Seminars - Department of Computer Science and Technology series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge talks
- Chris Davis' list
- computer science
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Lecture Theatre 1, Computer Laboratory, William Gates Building
- Martin's interesting talks
- School of Technology
- se393's list
- Trust & Technology Initiative - interesting events
- Wednesday Seminars - Department of Computer Science and Technology
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Dr Fermin Moscoso del Prado Martin - Department of Computer Science and Technology, University of Cambridge
Wednesday 28 February 2024, 15:05-15:55