The amount of biological data being generated has exploded in the past 5 years, primarily due to the decreasing costs of high throughput sequencing. In particular tens of thousands of cancer genomes have been sequenced. Interpreting this data in a meaningful way however, is challenging. We hypothesized that the patterns of genetic changes in cancers genomes that we observe are a consequence of how cancers grow. To test this we developed a stochastic simulation of tumor growth, which we could fit to cancer genomic data using the Approximate Bayesian Computation (ABC) framework. Due to Julia’s unique capabilities we implemented our stochastic simulations as well as the statistical inference in the language. Approximate Bayesian methods require potentially millions of simulations with different parameters to accurately fit to data, Julia’s speed and ease of use makes it an attractive option for such problems. Marc will also touch on why Julia holds huge promise for other “big data” problems and how he’s been using Julia as an open research tool with other projects such as Jupyter notebooks.
YOU MAY ALSO LIKE:
- Got The Perfect Agile Model? Here’s What You Should Know About Agnostic Agile (in London on 25th July 2017)
- Masterclass: Reducing Technical Debt with Michael C. Feathers (in London on 21st - 22nd August 2017)
- Agile Testing & BDD eXchange 2017 (in London on 9th - 10th November 2017)
- Serverless Architecture with Azure Functions with Christos Matskas! (in London on 29th November 2017)
Making sense of biological Big Data using Julia
Marc Williams is a 3rd year PhD student at the Barts Cancer Institute and University College London. Originally trained in physics, he now applies mathematical modelling techniques to understand cancer through the lens of evolutionary biology. His work involves mathematical and computational modelling together with Bayesian inference techniques and analysis of large genomic datasets.