|
|
Hadoop User Group UK:The Terrier Project
The Terrier Project
Terrier is a robust and modular Information Retrieval engine.
From version 2.2, Terrier supports the indexing of large collections in a Hadoop Map Reduce fashion. This uses the single-pass indexer to index sections of each collection (as batches of files) as map tasks. The output from the Map tasks take three forms: (a) terms and mini posting lists (known as runs in the single-pass indexer); (b) document indices from each map task; (c) information about the number of documents saved per run.
More information can be found here .
ABOUT IADH OUNIS
|
Iadh Ounis works as a Reader at the Department of Computing Science at the University of Glasgow and is the principal investigator of the Terrier project. The Terrier Project is doing a lot of work on Web, Blog and Enterprise search, Desktop, Intrane
More about Iadh Ounis
|
ABOUT THE HADOOP USER GROUP UK
|
We are the Hadoop users group for the UK based in London. We meet monthly for talks and discussion on all topics related to Hadoop. Join if your intrested in learning what Hadoop is, how people are using it with their big data problems, and to meet other people with experience running and coding on Hadoop cluster around London and the UK.
More about the Hadoop User Group UK
|
|
PODCAST THE TERRIER PROJECT
This session took part at the Hadoop User Group Meeting #2. You can view the other 7 podcasts here.
|
|
|