Algorithmic Investigation of Large Biological Data sets
- đ¤ Speaker: Lee Clewley, GSK
- đ Date & Time: Tuesday 27 February 2018, 13:00 - 14:00
- đ Venue: MR3 Centre for Mathematical Sciences
Abstract
Recently the rapid growth of both internal and external data sources in conjunction with large external databases has increased the need for GSK to address the most complex problems in drug discovery. For example, the chemical database ChEMBL, coupled with various biological databases internal and external to GSK with have meant that there is presently an enormous set potential set of research avenues that will yield biologically interesting insights. Such datasets provide a rich environment for deployment of algorithms such as Tensor flow, Deepchem or Topological Data Analysis depending on the form of the data.
In this project, the student will explore and create several algorithms that will be applied to curated datasets to test a range of biological hypothesis. This project is relatively open-ended and so the student should be ready to explore and evaluate current academic work and applicable solutions. The student should be prepared to collaboratively suggest viable hypothesis based on the data at hand.
The student should also be prepared, with aid from supervisors and contacts within the company, to demonstrate their findings in the form of visualizations, code-based models, or another appropriate medium.
Series This talk is part of the Cambridge Mathematics Placements Seminars series.
Included in Lists
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 27 February 2018, 13:00-14:00