Computer-generated Cryptic Crossword Clues: exploring creative NLG
- 👤 Speaker: David Hardcastle, The Open University
- 📅 Date & Time: Friday 07 March 2008, 12:00 - 13:00
- 📍 Venue: SW01 Computer Laboratory
Abstract
In this seminar I present some of the threads of research from my PhD, in which I constructed a system called enigma which writes cryptic crossword clues automatically.
Cryptic crossword clues are highly unusual texts: they have two independent layers of meaning, each of which has a completely separate syntax and semantics. The starting goal for enigma is a representation of the puzzle that will be hidden in the clue. This puzzle can be rendered in very many different ways according to the conventions of cryptic crossword writing, but only a handful of these renderings will also appear to be pieces of English prose, a fundamental requirement for any clue.
Following the approach taken by human compilers described in the expert literature on cryptic crossword compilation, and my own introspection, enigma explores the search space bottom up using natural language semantic and selectional constraints as a pruning heuristic.
The implementation takes the form of a chunk-based generator in which the grammar rules are encoded as syntactic constraints on the attachments which are allowed to form between chunks. Each attachment is also underwritten by semantic selectional constraints based on data sources mined from corpora and related online resources using statistical techniques.
Enigma was evaluated with a Turing-style test in which subjects were presented with computer-generated clues and clues from national newspapers which had the same answer and had to decide which was which. On average subjects identified the human-authored clues 75% of the time.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- SW01 Computer Laboratory
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

David Hardcastle, The Open University
Friday 07 March 2008, 12:00-13:00