Skillscast coming soon.
This talk describes a parallel, distributed free text index written at HP Labs Bristol called Distributed Lucene. Distributed Lucene is based on two Apache open source projects, Hadoop and Lucene, and follows a design originally proposed by Doug Cutting. It was written to gain a better understanding of the Apache Hadoop architecture, and to investigate approaches to creating large, scalable free text indexes. For more information see the accompanying HP Labs technical report.
YOU MAY ALSO LIKE:
- Typesafe's Apache Spark: An Introductory Workshop For Developers (in London on 16th - 17th September 2015)
- Conway's Law & Reverse Conway's Law - How to avoid being caught by it and how to turn it to your advantage (in London on 23rd September 2015)
- Itamar Syn-Hershko's Fast Track to ElasticSearch & ELK (in London on 19th - 21st October 2015)
- µCon 2015: The Microservices Conference (in London on 9th - 10th November 2015)
Distributed Lucene for Hadoop
Mark Butler has a varied background in computer science research, having worked on distributed systems, computational biology, software for formulating consumer products, the mobile web and the semantic web. He has a PhD in Computer Science and is