Please log in to watch this conference skillscast.
Combing near real time stream processing with batch analytics is the goal for so many companies. Why? Well, rather than getting new insights the next day after a nightly batch job you can start to get them with in seconds with stream processing. The result? Fast results that are up to date but also take into account vast amounts of historical data. Typically this is two technology stacks, e.g. Storm for stream processing and Hadoop for batch analytics. Through this talk you will lean how to do it all with the same stack: Spark running on Cassandra.
This talk will explore: an overview of Cassandra and how to model time series data, hooking up Spark stream processing to do on-the-fly aggregates, and running Spark batch jobs.
Oh and did Christopher mention? It's all in Scala.
YOU MAY ALSO LIKE:
- Building Scalable, Back Pressured Services with Akka (SkillsCast recorded in December 2017)
- Lightbend Akka for Scala - Professional (in London on 11th - 12th November 2019)
- Advanced Scala with Dick Wall (in London on 9th - 11th December 2019)
- Scala eXchange London 2019 (in London on 12th - 13th December 2019)
- Scalax2gether Community Day 2019 (in London on 14th December 2019)
- Code Kata: Yilin Wei - Optics with Monocle (in London on 22nd October 2019)
- Don’t keep it to yourself - openness and honesty in the workplace (in London on 30th October 2019)
- Abstract Data Types In The Region Of Abysmal Pain, And How To Navigate Them (SkillsCast recorded in September 2019)
- The Last Frontier and Beyond (SkillsCast recorded in August 2019)
Combining batch and stream analytics with Apache Spark and Apache Cassandra
Christopher is a Senior Engineer at Lightbend. He is currently on the core Akka team responsible for developing Akka (https://akka.io/), Akka Http, Akka Streams, Reactive Kafka and Alpakka (https://github.com/akka/alpakka). He has previously built trading systems, online television platforms and worked extensively with Apache Cassandra. Likes: Scala, Java, the JVM, Akka, distributed databases, XP, TDD, Pairing. Dislikes: Untested software and code ownership.