BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:A modular architecture for Unicode text compression - Adam Gleave 
 (University of Cambridge)
DTSTART:20160614T140000Z
DTEND:20160614T141500Z
UID:TALK66489@talks.cam.ac.uk
CONTACT:Adam Gleave
DESCRIPTION:Unicode is now ubiquitous\, with 87% of online content in the 
 UTF-8 character encoding. Conventional compression techniques operate on i
 ndividual bytes: this works well for ASCII\, but poorly for UTF-8\, where 
 a character can span multiple bytes. Previous attempts at Unicode compress
 ion have invented new algorithms from scratch\, with generally poor result
 s. My approach is to extend existing data compression algorithms to operat
 e over Unicode characters. I find this substantially improves compression 
 effectiveness for Unicode text\, with only a small overhead for ASCII and 
 binary files.\n\nPlease note the talk will last for 15 minutes\, although 
 I will be available afterwards for any further questions.
LOCATION: Cambridge University Engineering Department\, CBL Seminar room B
 E4-38
END:VEVENT
END:VCALENDAR
