Please log in to watch this conference skillscast.
How can a small team with a limited budget enable the analysis of large volumes of data in a world of constantly changing requirements?
Lindsey and Phil will share with you how the Guardian has used a range of technologies including Apache Spark and PrestoDB on AWS to support simple ingestion and fast querying of a wide range of datasets. Learn why it’s important to decouple storage from compute and raw data sources from optimised query formats and why there’s still no single perfect solution.
YOU MAY ALSO LIKE:
The Agile Data Warehouse - Beginners
Phil is a Senior Developer Manager at the Guardian.
Lindsey is a Data Engineer working at Deliveroo and is passionate about developing processes and systems to drive data-driven decisions across an organisation. She previously worked on the data technology team at the Guardian.