BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:Conservation Evidence - Shrey Biswas / Radhika Iyer / Kacper Micha
 lik\, University of Cambridge
DTSTART:20241129T130000Z
DTEND:20241129T135500Z
UID:TALK213109@talks.cam.ac.uk
CONTACT:114742
DESCRIPTION:*Abstract*:\n\nGrey literature’s inherent nature means that 
 it is a difficult form of media to discover\, typically being hidden deep 
 within websites\, analyse\, following no standard file formats or structur
 es\, and process\, due to the sheer volume of existing and actively produc
 ed literature\, this forms a massive cost and time problem for organisatio
 ns that require such literature in their function.\n\nWe devise and implem
 ent a pipeline that uses Common Crawl internet archives to locate & scrape
  potential grey literature\; then process it for use in a multistage machi
 ne learning pipeline to classify and output relevant media. \n\n*Bios*:\n\
 n*Shrey Biswas* is a second-year Computer Science Student at Pembroke Coll
 ege.\n\n*Radhika Iyer* is a second-year Computer Science Student at Murray
  Edwards College.\n\n*Kacper Michalik* is a Second-year Computer Science S
 tudent at Pembroke College.\n
LOCATION:FW11\, William Gates Building. Zoom link: https://cl-cam-ac-uk.zo
 om.us/j/4361570789?pwd=Nkl2T3ZLaTZwRm05bzRTOUUxY3Q4QT09&amp\;from=addon 
END:VEVENT
END:VCALENDAR
