Click here for full text:
Distributed Lucene : A distributed free text index for Hadoop
Butler, Mark H.; Rutherford, James
HP Laboratories
HPL-2008-64
Keyword(s): distributed, high availability, free text, parallel, search
Abstract: This technical report described a parallel, distributed free text index written at HP
Labs called Distributed Lucene. Distributed Lucene is based on two Apache open source projects, Lucene and Hadoop. It was written to gain a better understanding of the Apache Hadoop architecture, which is derived from work at Google on creating large distributed, high availability systems from commodity components.
12 Pages
External Posting Date: June 7, 2008 [Fulltext]. Approved for External Publication
Internal Posting Date: June 7, 2008 [Fulltext]
Back to Index
|