BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:Kafka and Samza: distributed stream processing in practice - Marti
 n Kleppmann - LinkedIn
DTSTART:20141112T140000Z
DTEND:20141112T150000Z
UID:TALK54973@talks.cam.ac.uk
CONTACT:David Greaves
DESCRIPTION:Stream processing is an old idea\, but it is currently being r
 ediscovered in industry due to pressures from increasing data volumes (thr
 oughput)\, increasingly diverse data sources (complexity) and increasing i
 mpatience (latency).\n\nApache Samza and Apache Kafka\, two open source pr
 ojects that originated at LinkedIn\, are being successfully used at scale 
 in production. Kafka is a fault-tolerant message broker\, and Samza provid
 es a scalable processing model on top of it. They have an interesting "bac
 k to basics" approach which questions many assumptions from the last few d
 ecades of data management practice.\n\nIn particular\, their design is inf
 ormed by the experience of operating large-scale systems under heavy load\
 , and the challenges that arise in a large organisation with hundreds or e
 ven thousands of software engineers. This talk will introduce the architec
 ture of Samza and Kafka\, and explain some of the reasoning behind their u
 nderlying design decisions.\n
LOCATION:Lecture Theatre 1\, Computer Laboratory
END:VEVENT
END:VCALENDAR
