Technical Reports

HPL-2009-155

Click here for full text: PDF

Using hybrid search and query for e-discovery identification

Grosvenor, Dave; Seaborne, Andy
HP Laboratories

HPL-2009-155

Keyword(s): SPARQL, e-discovery, identification, hybrid search and query

Abstract: We investigated the use of a hybrid search and query for locating enterprise data relevant to a requesting party's legal case (e-discovery identification). We extended the query capabilities of SPARQL with search capabilities to provide integrated access to structured, semi-structured and unstructured data sources. Every data source in the enterprise is potentially with in the scope of E-discovery Identification. So we use some common enterprise structured data sources that provide product and organizational information to guide the search and restrict it to a manageable scale. We use hybrid search and query to conduct a rich high-level search, which identifies the key people and products to coarsely locate relevant data-sources. Furthermore the product and organizational data sources are also used to increase recall which is a key requirement for e- discovery Identification.

16 Pages

External Posting Date: July 6, 2009 [Fulltext]. Approved for External Publication
Internal Posting Date: July 6, 2009 [Fulltext]

Back to Index