Click here for full text: 
      
       
Distributed Lucene : A distributed free text index for Hadoop 
  Butler, Mark H.; Rutherford, James
 HP Laboratories
 
  HPL-2008-64
 Keyword(s): distributed, high availability, free text, parallel, search
 Abstract: This technical report described a parallel, distributed free text index written at HP
Labs called Distributed Lucene. Distributed Lucene is based on two Apache open source projects, Lucene and Hadoop. It was written to gain a better understanding of the Apache Hadoop architecture, which is derived from work at Google on creating large distributed, high availability systems from commodity components.
    12 Pages
   
   External Posting Date: June 7, 2008 [Fulltext].  Approved for External Publication
   Internal Posting Date: June 7, 2008 [Fulltext]
  Back to Index
   |