Bibliographic Details
| Title: |
Distributed and Collaborative Web Change Detection System. |
| Authors: |
Prieto, Víctor M.1 victor.prieto@udc.es, Álvarez, Manuel1 manuel.alvarez@udc.es, Carneiro, Víctor1 victor.carneiro@udc.es, Cacheda, Fidel1 fidel.cacheda@udc.es |
| Source: |
Computer Science & Information Systems. Jan2015, Vol. 12 Issue 1, p91-114. 24p. |
| Subjects: |
Internet searching, Website access control, Search engines, Electronic indexes, Internet content, Downloading |
| Abstract: |
Search engines use crawlers to traverse the Web in order to download web pages and build their indexes. Maintaining these indexes up-to-date is an essential task to ensure the quality of search results. However, changes in web pages are unpredictable. Identifying the moment when a web page changes as soon as possible and with minimal computational cost is a major challenge. In this article we present theWeb Change Detection system that, in a best case scenario, is capable to detect, almost in real time, when a web page changes. In a worst case scenario, it will require, on average, 12 minutes to detect a change on a low PageRank web site and about one minute on a web site with high PageRank. Meanwhile, current search engines require more than a day, on average, to detect a modification in a web page (in both cases). [ABSTRACT FROM AUTHOR] |
|
Copyright of Computer Science & Information Systems is the property of ComSIS Consortium and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) |
| Database: |
Engineering Source |