Skillscast coming soon.
At Last.fm, the number of "write once, run never again" Hadoop programs has been growing steadily, especially in the research team. Since Java is a very verbose and compiled programming language, it is not very suitable for writing such programs. A better way to quickly write MapReduce programs is provided by Hadoop Streaming, but it still is less convenient than it could be. Dumbo is a simple enhancement to Hadoop Streaming that addresses this issue. More specifically, it is a Python module that makes Hadoop Streaming elegant and easy.
YOU MAY ALSO LIKE:
- Russ Miles' Microservices Workshop (in London on 4th - 6th September 2017)
- Uncle Bob's Advanced TDD (in London on 30th - 31st October 2017)
- Agile Testing & BDD eXchange 2017 (in London on 9th - 10th November 2017)
- Serverless Architecture with Azure Functions with Christos Matskas! (in London on 29th November 2017)
Dumbo: Hadoop streaming made elegant and easy
Klaas Bosteels is a Hadoop expert and works at the Department of Applied Mathematics and Computer Science, Ghent University, where he is working towards a Ph.D. degree as a member of the Fuzziness and Uncertainty Modelling Research Group, in close