Statistical Significance Analysis of Motif Discovery
- ๐ค Speaker: Patrick Ng
- ๐ Date & Time: Friday 05 November 2010, 10:00 - 11:00
- ๐ Venue: Small public lecture room, Microsoft Research Ltd, 7 J J Thomson Avenue (Off Madingley Road), Cambridge
Abstract
The identification of transcription factor binding sites, and of cis-regulatory elements in general, is an important step in understanding the regulation of gene expression. To address this need, many motif-finding tools have been described that can find short sequence motifs given only an input set of sequences. In the first part of the talk, I will discuss why a reliable significance evaluation should be considered an essential component of any motif finder, and then I will introduce a novel biologically realistic method to estimate the reported motif’s statistical significance based on a novel 3-Gamma approximation scheme. Furthermore, I will show how its reliability can be further improved by incorporating local base composition information. Finally, I will present GIMSAN : a tool for de novo motif finding that incorporates this novel significance evaluation technique.
In the second part of my talk, I will present ALICO (Alignment Constrained) null set generator: a framework to generate randomized versions of an input multiple sequence alignment that preserve some of its crucial features including its dependence structure. In particular, I will show that, on average, ALICO samples approximately preserve the PIDs (percent identities) between every pair of input sequences as well as the average Markov model composition. I will demonstrate its utility in phylogenetic motif finders, which are finders that leverage on conservation information.
Series This talk is part of the Microsoft Research Cambridge, public talks series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- Interested Talks
- Microsoft Research Cambridge, public talks
- ndk22's list
- ob366-ai4er
- Optics for the Cloud
- personal list
- PMRFPS's
- rp587
- School of Technology
- Small public lecture room, Microsoft Research Ltd, 7 J J Thomson Avenue (Off Madingley Road), Cambridge
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Patrick Ng
Friday 05 November 2010, 10:00-11:00