Please log in to watch this conference skillscast.
Users of Big (and not so Big) Data roughly divide into three groups, developers like us, traditional data analysts, and a hybrid called data scientists. The analysts prefer SQL, SAS, and similar, traditional tools. The scientists (mostly statisticians, really) prefer Python and R, with Julia emerging. The Developers started with Java, but they are being seduced by Scala, because it offers ideal tools for data-centric applications.
This talk explains why data-centric applications are driving Scala adoption. Scala already provides these essential features:
- Expressive DSLs.
- The JVM.
- Actors for distributed scaling.
- Optimizations for primitives, but uniform source abstractions.
- Functional combinators.
These features combine to yield powerful, scalable tools with concise APIs:
- Scalding and Summingbird - for Hadoop and Storm
- Spark and H2O - the Next Generation...
- Spire and Algebird - Mathematics
Finally, we'll discuss what's missing and what's ahead.
YOU MAY ALSO LIKE:
- Lessons Learned from 15 Years of Scala in the Wild (SkillsCast recorded in May 2022)
- ScalaCon 2022: Opening Keynote Networking Party (in London on 4th October 2022)
- ScalaCon 2022 (Online Conference on 4th - 8th October 2022)
- Developer Productivity Engineering - Next Big Thing in Software Development (in London on 29th June 2022)
- LJC Lunchtime Lighting Talks (Online Meetup on 1st July 2022)
- A Conceptual Introduction to Nix for Haskellers (SkillsCast recorded in November 2021)
- Haskell in the Enterprise: A Report from in the Belly of the Whale (SkillsCast recorded in November 2021)
Why Scala is Taking Over the Big Data World
Dean Wampler
Product Engineering Director for Accelerated Discovery
IBM Research