Improving Relevance Feedback via Using Support Vector Machines

Article Preview

Abstract:

Traditional relevance feedback technique could help improve retrieval performance. It usually utilize the most frequent terms in the relevant documents to enrich the user’s initial query. We re-examine this method and find that many expansion terms identified in traditional approaches are indeed unrelated to the query and harmful to the retrieval. This paper introduces a Support Vector Machines Based method to improve the retrieval results. Firstly, the classifier is trained on the feedback documents. Then, we can utilize this classifier to classify the rest of the documents and move relevant documents to the front of irrelevant documents. This new approach avoids modifying the initial query, so it’s a new direction for the relevance feedback techniques. Our Experiments on TREC dataset demonstrate that retrieval effectiveness can be improved more than 24.37% when our proposed approach is used.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 255-260)

Pages:

2028-2032

Citation:

Online since:

May 2011

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] AMO P, FERRERAS F L, CRUZ F, et al. Smoothing functions for automatic relevance feedback in information retrieval. Proc of the 11th International Workshop on Database and Expert Systems Applications. 2000: 115-119.

DOI: 10.1109/dexa.2000.875013

Google Scholar

[2] A. Spink, B. J. Jansen, D. Wolfram, and T. Saracevic. From e-sex to e-commerce: Web search changes. IEEE Computer, 35(3): 107-109, 2002.

DOI: 10.1109/2.989940

Google Scholar

[3] Bishop, C. Patten recognition and machine learning. Berlin: Springer-Verlag, 2006.

Google Scholar

[4] G. Salton and M. J. McGill. Introduction to Modern Information Retrieval. New York: McGraw-Hill, 1983.

Google Scholar

[5] Guihong Cao, Jian-Yun Nie, Jianfeng Gao, Stephen Robertson. Selecting good expansion terms for pseudo-relevance feedback. Proc of ACM SIGIR Conference on Research and Development in Information Retrieval. 2008.

DOI: 10.1145/1390334.1390377

Google Scholar

[6] Hsu CW, Chang CC, Lin CJ. A practical guide to support vector classification. 2003-08-10/2004-11-10, http://www.csie.ntu.edu.tw/-cjlin/papers/guide/guide.pdf

Google Scholar

[7] IWAYAMA M. Relevance feedback with a small number of relevance judgements. Proc of ACM SIGIR Conference on Research and Development in Information Retrieval. 2000: 10-16.

DOI: 10.1145/345508.345538

Google Scholar

[8] Rocchio J J. Relevance feedback in information retrieval//Salton G. The Smart Retrieval System: Experiments in Automatic Document Processing. New Jersey: Prentice Hall, 1971: 313-323.

Google Scholar

[9] TAN Bin, VELIVELLIA, FANG Hui, et al. Term feedback for information retrieval with language models. Proc of ACM SIGIR Conference on Research and Development in Information Retrieval. 2007: 263-270.

DOI: 10.1145/1277741.1277788

Google Scholar

[10] V. Vapnik. Statistical Learning Theory. New York: John Wiley & Sons Inc, 1998.

Google Scholar

[11] X. Shen and C. Zhai. Active feedback in ad hoc information retrieval. Proc of ACM SIGIR conference on Research and development in information retrieval, pages 55–66, March 2005.

DOI: 10.1145/1076034.1076047

Google Scholar