Spark Structured Streaming provides the means to express streaming computations the same way as it would be made with static data. The built-in engine is incrementally and continuously updating the final results as streaming data continues to arrive. We’ll cover how a real life implementation of Spark Structured Streaming on top of a Hadoop Cluster is helping a big online retailer to analyse clickstream data and aggregate it with customer history information.
YOU MAY ALSO LIKE:
- Masterclass: Reducing Technical Debt with Michael C. Feathers (in London on 21st - 22nd August 2017)
- Uncle Bob's Advanced TDD (in London on 30th - 31st October 2017)
- Uncle Bob's Clean Code: Agile Software Craftsmanship (in London on 1st - 3rd November 2017)
- µCon 2017: The Microservices Conference (in London on 6th - 7th November 2017)
Spark Structured Streaming in Practice
Andrei Muraru is a Solution Architect at Bigstep. He has designed and implemented complex big data projects for more than 4 years. Currently, he is focused on large-scale real-time implementations. He is helping customers begin their journey with big data workloads by providing meaningful insights on the products and services that are appropriate for their use case.