BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:Vision-language models (VLMs) - Varun Jain (University of Cambridg
 e)
DTSTART:20250611T100000Z
DTEND:20250611T113000Z
UID:TALK233395@talks.cam.ac.uk
CONTACT:120952
DESCRIPTION:This talk will chart the evolution of vision-language models (
 VLMs) and illustrate how architectural innovations and training paradigms 
 have progressively closed the gap between visual perception and natural‐
 language understanding. I will cover models such as CLIP\, Flamingo and LL
 aVA and discuss each of their design principles\, strengths and weaknesses
 \, and comparative performance across standard benchmarks.
LOCATION:Cambridge University Engineering Department\, CBL Seminar room BE
 4-38.
END:VEVENT
END:VCALENDAR