R has been described as 'a DSL for statistical analysis'. Hadoop is for LARGE scale computing. Between them, they can take on a number of interesting problems - once you get them to play together. Which is actually both easier and more accessible than you might think. In this demo I will solve a simple map/reduce problem in R, and run it on an Amazon EMR cluster.
YOU MAY ALSO LIKE:
- A Firehose of Functional Programming in Swift with Daniel Steinberg (in London on 21st March 2018)
- iOSCon 2018 - The conference for iOS and Swift Developers (in London on 22nd - 23rd March 2018)
- Brian Sletten's Data Science with R Workshop (in London on 2nd - 4th July 2018)
- Lightbend Apache Spark for Scala - Professional (in London on 3rd - 4th July 2018)
An example of a map/reduce algorithm using R and Hadoop
Anette is a consultant for ThoughtWorks where she builds people, teams, projects and occasionally a bit of code. She has worked in a number of different countries, industries and development stacks to solve all sorts of problems, but lately it has be