BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:A semantics knowledge commons for climate change - Peter Murray-Ru
 st\, Reader Emeritus in Molecular Informatics\, Yusuf Hamied Department of
  Chemistry
DTSTART:20221019T130000Z
DTEND:20221019T140000Z
UID:TALK182303@talks.cam.ac.uk
CONTACT:Samantha Noel
DESCRIPTION:Bioscience is fortunate in that the community has created a ve
 ry large frictionless semantic knowledge commons for the data it creates a
 nd uses. \n* knowledge: the information is organized systematically \n* se
 mantic: machines can "understand" the knowledge\, either because it contai
 ns instructions and/or the toolchain is universal.\n* frictionless data: t
 he data can be immediately unpacked without logins or explicit permissions
 \n* commons: everyone can take part in the knowledge regardless of country
 \, experience\, age.\n\nMost other subjects have highly heterogeneous data
  without semantics and this holds back the creation of knowledge. There is
  a pressing need to make knowledge about climate available to mitigate the
  effects of gaseous emissions. The most important resource is the UN's IPC
 C reports\,  published about every five years. In 2021-2022 AR6\, with 10_
 000 pages\, was released. #semanticClimate is a group of young Indian scie
 nce students who are developing tools and community protocols to make IPCC
 .AR6 semantic.\n\nOur first step is to convert PDF to structured HTML (a m
 essy business) and then to use a variety of Text-mining tools to create vo
 cabularies. These are turned into a distributed ontology based on equivale
 nces with Wikidata items. Wikidata has 100 million items and maps onto mos
 t important metadata bases\, e.g. genes\, species\, chemicals and other in
 frastructure such as countries\, states\, protocols\, organizations\, rese
 arch establishments\, etc. This effectively creates a knowledge graph for 
 the reports\, mapped onto the public Linked Open Data cloud. \n\nThe syste
 m can be used for any set of documents\, such as a corpus for a literature
  report.\nAll tools and data are open and participants can use the systems
  locally or in Google Colab.\n\nRef: https://www.eventbrite.co.uk/e/the-cl
 imate-knowledge-hunt-hackathon-tickets-414825362827 (run on 2022-09-24)\nD
 r Gitanjali was a Cambridge-India Lecturer for 5 years \n
LOCATION:CMS\, Meeting Room 15
END:VEVENT
END:VCALENDAR
