Please log in to watch this conference skillscast.
How can a small team with a limited budget enable the analysis of large volumes of data in a world of constantly changing requirements?
Lindsey and Phil will share with you how the Guardian has used a range of technologies including Apache Spark and PrestoDB on AWS to support simple ingestion and fast querying of a wide range of datasets. Learn why it’s important to decouple storage from compute and raw data sources from optimised query formats and why there’s still no single perfect solution.
YOU MAY ALSO LIKE:
- Progressive .NET 2017 (in London on 13th - 15th September 2017)
- London Unreal Engine Meetup (in London on 20th September 2017)
- Fast Track to F# with Tomas Petricek & Phil Trelford (in London on 16th - 17th October 2017)
- Test Driven Development (TDD) Workshop with Damjan Vujnovic (in London on 7th - 8th December 2017)
The Agile Data Warehouse - Beginners
Phil is a Senior Developer Manager at the Guardian.
Lindsey is a Senior Software Engineer at the Guardian. She works on the data technology team, making all data available to the Guardian easily queryable and accessible so that data driven decisions can be made at any level in the business. Prior to joining the data team she has worked on the Guardian’s commenting system and developing the editorial tools used to produce digital content.