Technical Reports

HPL-2009-39

Click here for full text: PDF

Document summarization using Wikipedia

Ramanathan, Krishnan; Sankarasubramaniam, Yogesh; Mathur, Nidhi; Gupta, Ajay
HP Laboratories

HPL-2009-39

Keyword(s): Single Document Summarization, Wikipedia, ROUGE

Abstract: Although most of the developing world is likely to first access the Internet through mobile phones, mobile devices are constrained by screen space, bandwidth and limited attention span. Single document summarization techniques have the potential to simplify information consumption on mobile phones by presenting only the most relevant information contained in the document. In this paper we present a language independent single-document summarization method. We map document sentences to semantic concepts in Wikipedia and select sentences for the summary based on the frequency of the mapped-to concepts. Our evaluation on English documents using the ROUGE package indicates our summarization method is competitive with the state of the art in single document summarization.

6 Pages

Additional Publication Information: Published and presented at the First International Conference on HCI, Allahabad, India. Jan 20-23, 2009

External Posting Date: February 21, 2009 [Fulltext]. Approved for External Publication
Internal Posting Date: February 21, 2009 [Fulltext]

Back to Index