Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP

HP.com home


Technical Reports



» 

HP Labs

» Research
» News and events
» Technical reports
» About HP Labs
» Careers @ HP Labs
» People
» Worldwide sites
» Downloads
Content starts here

 
Click here for full text: PDF

Extracting and Re-using Structured Data from Wikis

Isbell, Jonathan; Butler Mark H.

HPL-2007-182

Keyword(s): wikipedia; semantic web; information extraction; wikis; metadata

Abstract: This report investigates simplifying the creation of structured data for use in Semantic Web applications. In the first phase of work, a prototype is created that extracts structured data on companies and unstructured data on acquisitions from Wikipedia. It then reuses this information in a data browser that can provide faceted, map and timeline views. In the second phase, we investigate more generic approaches for extracting structured data and related schema information from Wikipedia. We use this information to create user interfaces that simplify the creation of structured data about related topics. This demonstrates that it is possible to simplify the creation and re-use of structured data in ways that benefit users.

21 Pages

Back to Index

»Technical Reports

» 2009
» 2008
» 2007
» 2006
» 2005
» 2004
» 2003
» 2002
» 2001
» 2000
» 1990 - 1999

Heritage Technical Reports

» Compaq & DEC Technical Reports
» Tandem Technical Reports
Printable version
Privacy statement Using this site means you accept its terms Feedback to HP Labs
© 2009 Hewlett-Packard Development Company, L.P.