An Improved Method for Visual Word Generation Based on Kernel Function

Article Preview

Abstract:

This paper studies a novel visual word generation method in the Bag-of-words model for object categorization. The conventional Bag-of-words algorithm represents the cluster centers as visual words, which led to the incomplete expressions of image semantic information, so an improved method for visual word generation using the soft-decision based on kernel function is proposed. First, SIFT keypoints of images are extracted. Then, after clustering SIFT keypoints, some typical SIFT keypoints are selected from a cluster by kernel density estimation using a kernel function. Finally, these selected keypoints are trained employing SVM to generate a visual word of this cluster. Experimental results show that the proposed visual word generation method enhances the expressions of image semantic information, increases the recall ratio effectively, and improves significantly the effect of object categorization.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

166-169

Citation:

Online since:

October 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] F. -F. Li, P. Perona: A Bayesian hierarchical model for learning natural scene categories, In Proc. CVPR (2005).

Google Scholar

[2] David G. Lowe: Object recognition from local scale-invariant features, Proceedings of International Conference on Computer Vision, vol. 2, no. 9, pp.1150-1157 (1999).

DOI: 10.1109/iccv.1999.790410

Google Scholar

[3] David G. Lowe: Distinctive image features from scale-invariant keypoints, International Journal of Computer Vision, vol. 60, no. 2, pp.91-110 (2004).

DOI: 10.1023/b:visi.0000029664.99615.94

Google Scholar

[4] Krystian Mikolajczyk, Cordelia Schmid: A Performance Evaluation of Local Descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp.1615-1630 (2005).

DOI: 10.1109/tpami.2005.188

Google Scholar

[5] Jasper R. R. Uijlings: Real-Time Visual Concept Classification, IEEE Transactions on Multimedia, vol. 12, no. 7 (November 2010).

Google Scholar

[6] J. Uijlings, A. Smeulders, and R. Scha: Real-time bag-of-words, approximately, in Proc. ACM Int. Conf. Image and Video Retrieval (2009).

DOI: 10.1145/1646396.1646405

Google Scholar

[7] J. Zhang, M. Marszałek, S. Lazebnik, and C. Schmid: Local features and kernels for classification of texture and object categories: A comprehensive study, International Journal of Computer Vision, vol. 73, no. 2, pp.213-238 (2007).

DOI: 10.1007/s11263-006-9794-4

Google Scholar

[8] Y. Jiang, C. Ngo, and J. Yang: Towards optimal bag-of-features for object categorization and semantic video retrieval, In Proc. ACM Int. Conf. Image and Video Retrieval (2007).

DOI: 10.1145/1282280.1282352

Google Scholar