Technical Reports

HPL-2010-123

Semantic Analysis of Web Site Changes

Manjunath, Geetha; Gupta, Divanshu
HP Laboratories

HPL-2010-123

Keyword(s): Web portals, change analysis, hypertext, DOM, widgets

Abstract: Today, as the Internet has become the most prominent source of information, many desktop and mobile applications are built primarily based on website content. Web Widgets, which are single function web applications, are one such popular class applications. These applications provide the user with specific information from websites and need to periodically refresh the dynamic information published on the portals. Deligent use of networking resources, particularly on mobile devices, require tools that monitor websites and check whether the dependent web fragment has changed. We have devised an efficient mechanism to monitor websites for semantic changes and notification based on type of change between two versions of websites. In this paper, we present our new algorithm to compute web page differences and also share the results of an empirical study of semantic analysis of the changing behavior of different category of websites.

6 Pages

External Posting Date: September 21, 2010 [Abstract Only]. Approved for External Publication - External Copyright Consideration
Internal Posting Date: September 21, 2010 [Fulltext]

Back to Index