Technical Reports
HPL-2009-39
Document summarization using Wikipedia
Ramanathan, Krishnan; Sankarasubramaniam, Yogesh; Mathur, Nidhi; Gupta, Ajay
HP Laboratories
HPL-2009-39
Keyword(s): Single Document Summarization, Wikipedia, ROUGE
Abstract: Although most of the developing world is likely to first access the Internet through mobile phones, mobile devices are constrained by screen space, bandwidth and limited attention span. Single document summarization techniques have the potential to simplify information consumption on mobile phones by presenting only the most relevant information contained in the document. In this paper we present a language independent single-document summarization method. We map document sentences to semantic concepts in Wikipedia and select sentences for the summary based on the frequency of the mapped-to concepts. Our evaluation on English documents using the ROUGE package indicates our summarization method is competitive with the state of the art in single document summarization.
6 Pages
Additional Publication Information: Published and presented at the First International Conference on HCI, Allahabad, India. Jan 20-23, 2009
External Posting Date: February 21, 2009 [Fulltext]. Approved for External Publication
Internal Posting Date: February 21, 2009 [Fulltext]