New probabilistic methods for inference of natural selection on regulatory sequences in the human genome
- đ¤ Speaker: Adam Siepel, Cornell University
- đ Date & Time: Tuesday 19 February 2013, 14:00 - 15:00
- đ Venue: Auditorium, Microsoft Research Ltd, 21 Station Road, Cambridge, CB1 2FB
Abstract
For decades, it has been hypothesized that gene regulation has played a central role in human evolution, yet much remains unknown about the genome-wide impact of regulatory mutations. Here we use complete genome sequence data to demonstrate that natural selection has exerted a profound influence on human regulatory sequences since our divergence from chimpanzees 4-6 million years ago. Our analysis is based on a new probabilistic method for characterizing the influence of natural selection on collections of short regulatory elements scattered across the genome. Our method, called Inference of Natural Selection from Interspersed Genomically coHerent elemenTs (INSIGHT), uses a generative probabilistic model to contrast patterns of genetic variation in humans and nonhuman primates in the elements of interest with those in nearby “neutral” sites. Using a Bayesian approach, we are able to pool weak information from many short elements in a manner that accounts for variation across the genome in patterns of neutral genetic variation. The model is efficiently fitted to genome-wide data by an approximate expectation maximization algorithm. Using simulations, we show that INSIGHT can accurately estimate the evolutionary parameters of interest even in complex evolutionary scenarios. We apply it to real genomic data and find that binding sites have experienced somewhat weaker selection than protein-coding genes, on average, but that the binding sites of several transcription factors show clear evidence of adaptation. We project that regulatory elements may make larger cumulative contributions than protein-coding genes to both adaptive substitutions and deleterious polymorphisms, which has important implications for human evolution and disease.
Series This talk is part of the Microsoft Research Machine Learning and Perception Seminars series.
Included in Lists
- All Talks (aka the CURE list)
- Auditorium, Microsoft Research Ltd, 21 Station Road, Cambridge, CB1 2FB
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- Interested Talks
- Machine Learning Summary
- Microsoft Research Cambridge, public talks
- Microsoft Research Machine Learning and Perception Seminars
- ML
- ndk22's list
- ob366-ai4er
- Optics for the Cloud
- personal list
- PMRFPS's
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Adam Siepel, Cornell University
Tuesday 19 February 2013, 14:00-15:00