It has been said by many that 80% data science is scrubbing data. In this talk we'll cover how you can use cascalog to scrub, transform, manipulate and mangle data into the formats you need, fix things that are wrong and filter out things that are broken.
YOU MAY ALSO LIKE:
- Clojure and Incanter for the Professional Programmer (SkillsCast recorded in March 2012)
- Leonardo De Marchi's Deep Learning Fundamentals (in London on 22nd - 23rd October 2019)
- Infiniteconf 2019 - A one-day community celebration of Big Data, Machine Learning and AI (in London on 4th July 2019)
- Clojure eXchange 2019 (in London on 2nd - 3rd December 2019)
- Keynote by Emily Robinson on Creating a Strong Data Science Portfolio (in London on 24th June 2019)
- London Java Community June (in London on 25th June 2019)
- Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive Programming - Cedrick Lunven, Developer Advocate, DataStax (SkillsCast recorded in May 2019)
- Inextricably Linked: Reproducibility and Productivity in Data Science and AI - Mark Coleman, VP Marketing, dotscience & Marketing Chairperson, Cloud Native Computing Foundation (SkillsCast recorded in May 2019)
Cascalog for the 80% of Data Science
Bruce is the co-founder and active leader of several Clojure and Java related communities and contributes to many meetups, community gatherings and courses at Skills Matter.