BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:Modeling Genetic Documents Written by DNA - Naruemon Pratanwanich 
 (University of Cambridge)
DTSTART:20140512T130000Z
DTEND:20140512T140000Z
UID:TALK52336@talks.cam.ac.uk
CONTACT:Advait Sarkar
DESCRIPTION:For decades\, biologists have recorded the expression levels o
 f ten thousands of genes under many biological conditions of interest. How
 ever\, such a huge list of genes would leave a burden of expertise for int
 erpretation. In this lecture\, I will present how this information can be 
 deciphered into a more readable and understandable format. In particular\,
  this data profile will be regarded as a genetic document that is written 
 by DNA activities occurred in a cell.\n\nStarting from Latent Dirichlet Al
 location (LDA) which was originally developed in the field of text mining 
 in order to model the relation of words in documents based on the abstract
  definition of topics\, I will give you an overview of an inference method
  on the model parameters and explain how to use the learned model for new 
 unseen documents. The concept of topic representation offers a less comple
 x but more efficient method to manage a huge collection of documents. Next
 \, I will demonstrate that the underlying intuitions of this model can be 
 transferred onto biological data. Ending with some successful examples of 
 LDA model extensions\, I will show that it is straightforward to re-design
  a generative probabilistic model when new assumptions are given.
LOCATION:LT2\, Computer Laboratory\, William Gates Building
END:VEVENT
END:VCALENDAR
