Please log in to watch this conference skillscast.
How can a small team with a limited budget enable the analysis of large volumes of data in a world of constantly changing requirements?
Lindsey and Phil will share with you how the Guardian has used a range of technologies including Apache Spark and PrestoDB on AWS to support simple ingestion and fast querying of a wide range of datasets. Learn why it’s important to decouple storage from compute and raw data sources from optimised query formats and why there’s still no single perfect solution.
YOU MAY ALSO LIKE:
The Agile Data Warehouse - Beginners
Phil is a Senior Developer Manager at the Guardian.
Lindsey is a Senior Software Engineer at the Guardian. She works on the data technology team, making all data available to the Guardian easily queryable and accessible so that data driven decisions can be made at any level in the business. Prior to joining the data team she has worked on the Guardian’s commenting system and developing the editorial tools used to produce digital content.