BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:Stratosphere: Massively parallel dataflow programming - Kostas Tzo
 umas\, Technical University of Berlin
DTSTART:20130201T100000Z
DTEND:20130201T110000Z
UID:TALK42914@talks.cam.ac.uk
CONTACT:Microsoft Research Cambridge Talks Admins
DESCRIPTION:As a reaction to the recent "Big Data" trend\, a new breed of 
 systems for scalable data processing has emerged. Our system\, Stratospher
 e\, offers an extensible query language for posing queries on complex nest
 ed data\, an efficient processing engine designed to scale on very large c
 lusters and leverage cloud elasticity\, as well as a query optimizer and a
  runtime engine that guarantee the efficient execution of queries\, includ
 ing iterative queries. Stratosphere pushes the MapReduce paradigm forward 
 by incorporating several optimizations known from parallel databases\, as 
 well as novel techniques\, while retaining the flexibility of in-situ proc
 essing of data using complex user-defined functions.\n\nIn this talk\, I w
 ill provide an overview of the Stratosphere system\, placing emphasis on h
 ow to optimize and execute in parallel an extended dataflow programming mo
 del with user-defined functions and iterative constructs. I will then prov
 ide a research outlook for scalable data analytics that includes research 
 topics in the intersection of programming languages\, databases\, and netw
 orks.
LOCATION:Auditorium\, Microsoft Research Ltd\, 21 Station Road\, Cambridge
 \, CB1 2FB
END:VEVENT
END:VCALENDAR
