Umbd6tcnr2zevmy8ythr
Meet up

Introduction to Sqoop

Thursday, 3rd June at Skills Matter, London

This meetup was organised by HUGUK: Hadoop User Group UK in June 2010

After an hiatus, the Hadoop User Group UK is meeting at Skills Matter again. Aaron Kimball from Cloudera will give an introduction to Sqoop, the open source SQL-to-Hadoop tool. Tim Sell from Last.fm will talk about using Hive in practice.

Introduction to Sqoop

This talk introduces Sqoop, the open source SQL-to-Hadoop tool. Sqoop helps users perform efficient imports of data from RDBMS sources to Hadoop's distributed file system, where it can be processed in concert with other data sources. Sqoop also allows users to export Hadoop-generated results back to an RDBMS for use with other data pipelines.

Aaron Kimball

Aaron Kimball has been working with Hadoop since early 2007. He has also worked as an independent consultant focusing on Hadoop and Amazon EC2-based systems.

Hive at Last.fm

This talk is about using Hive in practice. We will go through some of the specific use cases for which Hive is currently being used at Last.fm, highlighting its strengths and weaknesses along the way.

Tim Sell

Tim Sell is a software developer at Last.fm, and a curious observer of the HBase subproject of Hadoop.

Thanks to our sponsors

Attending Members

Sorry, no member has joined this event so far.

Overview

After an hiatus, the Hadoop User Group UK is meeting at Skills Matter again. Aaron Kimball from Cloudera will give an introduction to Sqoop, the open source SQL-to-Hadoop tool. Tim Sell from Last.fm will talk about using Hive in practice.

Introduction to Sqoop

This talk introduces Sqoop, the open source SQL-to-Hadoop tool. Sqoop helps users perform efficient imports of data from RDBMS sources to Hadoop's distributed file system, where it can be processed in concert with other data sources. Sqoop also allows users to export Hadoop-generated results back to an RDBMS for use with other data pipelines.

Aaron Kimball

Aaron Kimball has been working with Hadoop since early 2007. He has also worked as an independent consultant focusing on Hadoop and Amazon EC2-based systems.

Hive at Last.fm

This talk is about using Hive in practice. We will go through some of the specific use cases for which Hive is currently being used at Last.fm, highlighting its strengths and weaknesses along the way.

Tim Sell

Tim Sell is a software developer at Last.fm, and a curious observer of the HBase subproject of Hadoop.

Thanks to our sponsors

Who's coming?

Attending Members

Sorry, no member has joined this event so far.