Evaluation of Near-Duplicate Image Retrieval Algorithms for the Identification of Celebrities in Web Images

Article Preview

Abstract:

Near-duplicate image retrieval is a classical research problem in computer vision, for which a large number of diverse approaches have been proposed. Recent studies have revealed that it can be used as an intermediate step to implement search-based celebrity identification given the existence of huge volume of user-tagged or text-surrounded celebrity images on the web. However, the effectiveness of existing near-duplicate image retrieval methods for such a task still remains unclear. To address this issue, this paper presents a comprehensive study of the existing near-duplicate image retrieval methods in a structural way. Four representatives of the existing methods, i.e. hash signature, mean SSIM, BoVW with SIFT features and ARG, are experimentally evaluated using a self-constructed dataset containing 24762 images of 15 top searched celebrities collected using 6 news search engines and the Google image search engine. The experimental results reveal that, compared with global feature based methods, local feature based ones are usually more appropriate for the task of celebrity identification in web images, as they can deal with partial duplicate and scene similar images better. In particular, BoVW with SIFT features is recommended as it provides the best trade-off between on-line speed and retrieval accuracy.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 765-767)

Pages:

1431-1435

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] S. Avidan, et al., Internet Vision (Scanning the Spec. Issue), Proc. of the IEEE., vol. 98, pp.1367-1369, (2010).

Google Scholar

[2] X. Zhang. et al., Finding Celebrities in Billions of Web Images, IEEE Trans. on Multimedia., vol. 14, pp.995-1007, (2012).

DOI: 10.1109/tmm.2012.2186121

Google Scholar

[3] X. J. Wang, et al., Duplicate-Search-Based Image Annotation Using Web-Scale Data, Proceedings of the IEEE, vol. 100, pp.2705-2721, (2012).

DOI: 10.1109/jproc.2012.2193109

Google Scholar

[4] B. Wang. et al., Large-Scale Duplicate Detection for Web Image Search, in IEEE Intl. Conf. on Multimedia and Expo (ICME). 2006, pp.353-356.

Google Scholar

[5] Z. Wang, et al., Image Quality Assessment From Error Visibility to Structural Similarity, IEEE TRANSACTIONS ON IMAGE PROCESSING, vol. 13, pp.600-612, Apr. (2004).

DOI: 10.1109/tip.2003.819861

Google Scholar

[6] W. L. Zhao, X. Wu and C. W. Ngo. On the Annotation of Web Videos by Efficient Near-duplicate Search, IEEE Trans. on Multimedia, vol. 12, no. 5, pp.448-461, (2010).

DOI: 10.1109/tmm.2010.2050651

Google Scholar

[7] D. Zhang. Statistical part-based models: theory and applications in image similarity, object detection and region labeling, Columbia University, (2006).

Google Scholar

[8] Z. Wu, et al., Bundling features for large scale partial-duplicate web image search, in IEEE Conf. on Computer Vision and Pattern Recog. (CVPR). 2009, pp.25-32.

DOI: 10.1109/cvpr.2009.5206566

Google Scholar