Spark Structured Streaming provides the means to express streaming computations the same way as it would be made with static data. The built-in engine is incrementally and continuously updating the final results as streaming data continues to arrive. We’ll cover how a real life implementation of Spark Structured Streaming on top of a Hadoop Cluster is helping a big online retailer to analyse clickstream data and aggregate it with customer history information.
YOU MAY ALSO LIKE:
Spark Structured Streaming in Practice
Andrei Muraru is a Solution Architect at Bigstep. He has designed and implemented complex big data projects for more than 4 years. Currently, he is focused on large-scale real-time implementations. He is helping customers begin their journey with big data workloads by providing meaningful insights on the products and services that are appropriate for their use case.