|
|
HADOOP USER GROUP UK : CLOUDERA'S DISTRIBUTION FOR HADOOP AND CLOUDERA ENTERPRISE AND DISTRIBUTED STREAMING LOG COLLECTION WITH FLUME
|
As the size of your log files and other dynamically-generated data
increases, they becomes more and more difficult to manage. In this
talk we'll discuss how to use Flume, an open-source framework from
Cloudera, to collect your log files as they're generated and aggregate
them to where you want to process them.
Monday 6th September 2010
Track 1
CLOUDERA'S DISTRIBUTION FOR HADOOP AND CLOUDERA ENTERPRISE
Mike Olson: Cloudera has assembled a comprehensive, fully open-source distribution of
the Apache Hadoop software and related projects. This package, Cloudera's
Distribution for Hadoop (CDH), version 3, is easy to acquire, install,
configure, run and administer, and dramatically simplifies the use and
operation of Hadoop. View the podcast here...
RELIABLE, DISTRIBUTED STREAMING LOG COLLECTION WITH FLUME
Ian Wrigley: As the size of your log files and other dynamically-generated data
increases, they becomes more and more difficult to manage. In this
talk we'll discuss how to use Flume View the podcast here...
HADOOP IN CONTEXT
Andy Kent: This talk will tell the story of our adoption of Hadoop from our initial
in-house virtualised cluster and EC2 experiment to our current dedicated
cluster, the migration from our more traditional RDBMS data warehouse to
Hive, and how we've developed tools and infrastructure to integrate Hadoop View the podcast here...
|
|
|
|
|
© Copyright 2003-2013, Skills Matter Ltd
|
|
|