In this talk I will describe the architecture of a typical data application built on Hadoop (an event ingest and processing pipeline), and then show how it can be built using familiar Java constructs using the Cloudera Development Kit (CDK) - an open source project with the goal of simplifying Hadoop application development.
YOU MAY ALSO LIKE:
- InfiniteConf 2017 - the conference on Big Data, Data Science and Engineering (in London on 6th - 7th July 2017)
- Brian Sletten's Data Science with R Workshop (in London on 4th - 6th September 2017)
- Lightbend's Apache Spark: An Introductory Workshop For Developers (in London on 7th - 8th September 2017)
Building Data Applications with Hadoop - Presented by Tom White
Tom White is one of the foremost experts on Hadoop. He has been an Apache Hadoop committer since February 2007, and is a Member of the Apache Software Foundation. Tom is a software engineer at Cloudera, where he has worked, since its foundation, on t