The Agile Data Warehouse - Beginners

6th July 2017 in London at CodeNode

There are 43 other SkillsCasts available from Infiniteconf 2017 - the conference on Big Data and Fast Data

Please log in to watch this conference skillscast.

644095807 640

How can a small team with a limited budget enable the analysis of large volumes of data in a world of constantly changing requirements?

Lindsey and Phil will share with you how the Guardian has used a range of technologies including Apache Spark and PrestoDB on AWS to support simple ingestion and fast querying of a wide range of datasets. Learn why it’s important to decouple storage from compute and raw data sources from optimised query formats and why there’s still no single perfect solution.


Thanks to our sponsors

The Agile Data Warehouse - Beginners

Philip Wills

Phil is a Senior Developer Manager at the Guardian.

Lindsey Dew

Lindsey is a Senior Software Engineer at the Guardian. She works on the data technology team, making all data available to the Guardian easily queryable and accessible so that data driven decisions can be made at any level in the business. Prior to joining the data team she has worked on the Guardian’s commenting system and developing the editorial tools used to produce digital content.