Tw3qkndeecfb3apw6bmk
SkillsCast

Keynote: Spark+Hadoop and how it relates to Scala

10th December 2015 in London at Business Design Centre

There are 36 other SkillsCasts available from Scala eXchange 2015

Please log in to watch this conference skillscast.

548586199 640x360

Scala seems to be, suddenly, an important language within the Apache Hadoop ecosystem, with the arrival of Scala-based projects like Apache Kafka and Apache Spark -- which is in essence "distributed Scala". In fact, it's not a surprising marriage: Hadoop has been building on functional paradigms and immutability in its way for years through MapReduce/HDFS, and projects like Crunch and Cascading. This talk will give a Hadoop-centric take on the evolution of Scala, its benefits to Hadoop-related projects, why it succeeds where other languages don't in Hadoop, and some quirks that remain barriers to its further adoption in "big data".

YOU MAY ALSO LIKE:

Keynote: Spark+Hadoop and how it relates to Scala

Sean Owen

Sean is Director of Data Science at Cloudera in London. Before Cloudera, he founded Myrrix Ltd (now, the Oryx project) to commercialize large-scale real-time recommender systems on Apache Hadoop. He is an Apache Spark committer and co-authored Advanced Analytics on Spark. He was a committer and VP for Apache Mahout, and co-author of Mahout in Action. Previously, Sean was a senior engineer at Google.

SkillsCast

Please log in to watch this conference skillscast.

548586199 640x360

Scala seems to be, suddenly, an important language within the Apache Hadoop ecosystem, with the arrival of Scala-based projects like Apache Kafka and Apache Spark -- which is in essence "distributed Scala". In fact, it's not a surprising marriage: Hadoop has been building on functional paradigms and immutability in its way for years through MapReduce/HDFS, and projects like Crunch and Cascading. This talk will give a Hadoop-centric take on the evolution of Scala, its benefits to Hadoop-related projects, why it succeeds where other languages don't in Hadoop, and some quirks that remain barriers to its further adoption in "big data".

YOU MAY ALSO LIKE:

About the Speaker

Keynote: Spark+Hadoop and how it relates to Scala

Sean Owen

Sean is Director of Data Science at Cloudera in London. Before Cloudera, he founded Myrrix Ltd (now, the Oryx project) to commercialize large-scale real-time recommender systems on Apache Hadoop. He is an Apache Spark committer and co-authored Advanced Analytics on Spark. He was a committer and VP for Apache Mahout, and co-author of Mahout in Action. Previously, Sean was a senior engineer at Google.

Photos