|
|
Hadoop User Group UK:Dumbo: Hadoop streaming made elegant and easy
Dumbo: Hadoop streaming made elegant and easy
At Last.fm, the number of "write once, run never again" Hadoop
programs has been growing steadily, especially in the research team.
Since Java is a very verbose and compiled programming language, it is
not very suitable for writing such programs. A better way to quickly
write MapReduce programs is provided by Hadoop Streaming, but it still
is less convenient than it could be. Dumbo is a simple enhancement to
Hadoop Streaming that addresses this issue. More specifically, it is
Python module that makes Hadoop Streaming elegant and easy.
Download the slides here
ABOUT KLAAS BOSTEELS
|
Klaas Bosteels is a Hadoop expert and works at the Department of
Applied Mathematics and Computer Science, Ghent University, where he
is working towards a Ph.D. degree as a member of the Fuzziness and
Uncertainty Modelling Research Group, in close
More about Klaas Bosteels
|
ABOUT THE HADOOP USER GROUP UK
|
We are the Hadoop users group for the UK based in London. We meet monthly for talks and discussion on all topics related to Hadoop. Join if your intrested in learning what Hadoop is, how people are using it with their big data problems, and to meet other people with experience running and coding on Hadoop cluster around London and the UK.
More about the Hadoop User Group UK
|
|
PODCAST DUMBO: HADOOP STREAMING MADE ELEGANT AND EASY
This session took part at the Hadoop User Group Meeting. You can view the other 12 podcasts here.
|
|
|