Please log in to watch this conference skillscast.
HealthUnlocked is a social network centred around health issues, where people find information about chronic conditions. Our users share 4.5 pieces of health content every minute, which we classify into 700 different categories within milliseconds using machine learning.
The data science team at HealthUnlocked is used to using mature Python libraries to process text and implement machine learning algorithms. During this talk, you will explore a journey to translate a Python model prototype into Clojure production code.
You will learn how the HU team implemented our natural language processing pipeline, including tokenisation and vectorisation, as well as the core Naive Bayes algorithm, from first principles.
YOU MAY ALSO LIKE:
- Advanced Clojure (in London on 12th - 13th March 2018)
- Fast Track to Machine Learning with Louis Dorard (in London on 21st - 23rd March 2018)
- Brian Sletten's Data Science with R Workshop (in London on 26th - 28th March 2018)
- Infiniteconf 2018 - The conference on Big Data and Fast Data (in London on 5th - 6th July 2018)
Clojure for Data Science: from a Prototype in Python to Clojure in Production
Chloe is currently part of the data science team at HealthUnlocked which aims to improve user experience on the platform and facilitate user data analysis. She has always been passionate about healthcare and psychology and she enjoys digging for new insights in medical data.
Maria is the lead data scientist at HealthUnlocked, a health social network. Her main role is to build the data pipelines and machine algorithms that power THE team's content recommender and intelligent content tagger. Before working at HealthUnlocked, she did a PhD in Cambridge in signal processing & machine learning. She then moved to work at a startup using mainly big data tools (Spark) and Python.