BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:Crowdsourcing data modelling - Anthony Goldbloom (Kaggle)
DTSTART:20101118T110000Z
DTEND:20101118T120000Z
UID:TALK26402@talks.cam.ac.uk
CONTACT:Zoubin Ghahramani
DESCRIPTION:"Kaggle":http://kaggle.com\, is a global platform for data pre
 diction competitions allowing researchers and companies to post their prob
 lem and have it scrutinised by the world's data scientists.\n\nBy exposing
  a problem to a wide range of analysts and techniques\, data prediction co
 mpetitions turn out to be great way to get the most out of a dataset\, giv
 en its inherent noise and richness. For example\, Kaggle has been running 
 a bioinformatics competition requiring participants to pick markers in HIV
 's genetic sequence that predict a change in viral load (a measure of the 
 severity of infection). Within a week and a half\, the best submission had
  already outdone the best methods in the scientific literature.\n\nThis ta
 lk will introduce data modelling competitions and talk about some of the s
 tatistical challenges. 
LOCATION:Engineering Department\, CBL Room 438
END:VEVENT
END:VCALENDAR
