Processing Geographic Language
- đ¤ Speaker: Dr. Inderjeet Mani
- đ Date & Time: Friday 22 May 2009, 12:00 - 13:00
- đ Venue: SW01, Computer Laboratory
Abstract
Humans are able to communicate geographic information in a highly concise but vague manner, posing interesting challenges for natural language understanding. In recent years, information extraction systems have been developed to ground geographical references in text in terms of geo-coordinates, with the tags produced by such systems being used by geographical search engines and mapping tools. However, without a standard for how different types of geographical entities should be tagged, such systems are impossible to reliably evaluate. I will describe an annotation scheme called SpatialML, that has been used to accurately mark up places, their geo-coordinates, and spatial relationships in a variety of text corpora. SpatialML represents spatial relationships among geographical regions in terms of the Region Connection Calculus (RCC), and it has also been mapped to the Generalized Upper Model (GUM) ontology from the University of Bremen. SpatialML is also being used in the Cross-Language Evaluation Forum (CLEF) to assess tools to analyze geographical queries posed to search engines, and it is currently being integrated with a time markup standard (TimeML). Despite these positive trends, I will argue that a far more concerted research effort is required to address thefundamental challenges of geographic language.
Dr. Inderjeet Mani is a Visiting Fellow at Cambridge. He has been a Senior Principal Scientist at MITRE (in Boston), a Research Scholar at Brandeis University, a Research Affiliate at MIT , and a (tenured) Associate Professor at Georgetown University. His research areas in natural language processing include automatic summarization, temporal and spatial information extraction, and narrative understanding. More information can be found at www.cs.brandeis.edu/~im5.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- SW01, Computer Laboratory
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Dr. Inderjeet Mani
Friday 22 May 2009, 12:00-13:00