Please log in to watch this conference skillscast.
Data is a powerful tool. Data-driven systems leveraging modern analytical and predictive techniques can offer significant improvements over static or heuristic driven systems.
The question is:
- How much can you trust your data? Data collection, processing and aggregation is a challenging task.
- How do we build confidence in our data? Where did the data come from?
- How was it generated? What checks have or should be applied?
- What is affected when it all goes wrong?
This talk looks at the mechanics of maintaining data-quality at scale. Firstly looking at bad-data, what it is and where it comes from. Then diving into the techniques required to detect, avoid and ultimately deal with bad-data. At the end of this talk the audience should come away with an idea of how to design quality data-driven systems that ultimately build confidence and trust rather than inflate expectations.
YOU MAY ALSO LIKE:
- Systems That Don't Forget (SkillsCast recorded in July 2020)
- Rust Nation 23 (in London on 16th - 17th February 2023)
- F# eXchange 2023: In-Person (in London on 7th - 8th March 2023)
- Haskell In Person: Bring Your Project, Get help with your code & socialise (in Berlin on 1st February 2023)
- How to teach IntelliJ IDEA to your juniors (Online Meetup on 2nd February 2023)
- Teaching Haskell...To High Schoolers! (SkillsCast recorded in December 2022)
- Teaching Haskell...To High Schoolers! (SkillsCast recorded in December 2022)
Lake, Swamp or Puddle: Data Quality at Scale
Mark Hibberd
CTOKinesis