Techniques for Refreshing Images in Web Documents

Article Preview

Abstract:

In this paper, we put forward a technique for keeping web pages up-to-date, later used by search engine to serve the end user queries. A major part of the Web is dynamic and hence, a need arises to constantly update the changed web documents in search engine’s repository. In this paper we used the client-server architecture for crawling the web and propose a technique for detecting changes in web page based on the content of the images present if any in web documents. Once it is being identified that the image embedded in the web document is changed then the previous copy of the web document present in the search engine’s database/repository is replaced with the changed one.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 403-408)

Pages:

1008-1013

Citation:

Online since:

November 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Z. Dalal, S. Dash, P. Dave, L. Francisco- Revilla, R. Furuta, U. Karadkar, and F. Shipman, Managing Distributed Collections: Evaluating Web Page Changes, Movement, and Replacement, Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, pp.160-168, June (2004).

DOI: 10.1145/996350.996387

Google Scholar

[2] J. Cho, and H.G. -Molina, The Evolution of the Web and Implications for an Incremental Crawler, Proceedings of the 26th International Conference on Very Large Data Bases, p.200 – 209, (2000).

Google Scholar

[3] J. Cho, and Hector Garcia- Molina, Estimating Frequency of Change, ACM transaction on internet technology, vol3, issue 3, pp.256-290, Aug (2003).

DOI: 10.1145/857166.857170

Google Scholar

[4] F. Douglis and T. Ball, Tracking and viewing changes on the web, In USENIX Annual Technical Conference, p.165– 176, (1996).

Google Scholar

[5] D. Buttler, D. Rocco and L. Liu, Efficient Web Change Monitoring with Page Digest, In Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters, p.476 – 477, (2004).

DOI: 10.1145/1013367.1013533

Google Scholar

[6] D. Yadav, A.K. Sharma & J.P. Gupta, Architecture for parallel crawler and algorithm for web page change detection, IEEE Proceeding of 10th International Conference on IT, Rourkela, India, pp.258-264, Dec 17-20, (2007).

Google Scholar

[7] R. Vincent, and O. Folorunso, A Descriptive Algorithm for Sobel Image Edge Detection, Proceedings of Informing Science & IT Education Conference (InSITE) (2009).

DOI: 10.28945/3351

Google Scholar

[8] G. Pass, R. Zabih, and J. Miller, Comparing images using colour coherent vectors, Proceedings of Informing Science & IT Education Conference (InSITE) (2009).

Google Scholar

[9] Cho, Junghoo, and Garcia-Molina Hector, Parallel Crawlers", Proceedings of the 11th international conference on World Wide Web WWW '02, Honolulu, Hawaii, USA, ACM Press, p.124 – 135, (2002).

DOI: 10.1145/511446.511464

Google Scholar