This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Semisupervised Query Expansion with Minimal Feedback
November 2007 (vol. 19 no. 11)
pp. 1585-1589
Query expansion is an information retrieval technique in which new query terms are selected to improve search performance. Although useful terms can be extracted from documents whose relevance is already known, it is difficult to get enough of such feedback from a user in actual use. We propose a query expansion method that performs well even if a user makes practically minimum effort, that is, chooses only a single relevant document. To improve searches in these conditions, we made two refinements to a well-known query expansion method. One uses transductive learning to obtain pseudo relevant documents, thereby increasing the total number of source documents from which expansion terms can be extracted. The other is a modified parameter estimation method that aggregates the predictions of multiple learning trials to sort candidate terms for expansion by importance. Experimental results show that our method outperforms traditional methods, and is comparable to a state of the art method.

[1] I. Ruthven, “Re-Examining the Potential Effectiveness of Interactive Query Expansion,” Proc. 26th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 213-220, 2003.
[2] S. Dumais et al., “Sigir 2003 Workshop Report: Implicit Measures of User Interests and Preferences,” Proc. SIGIR Forum, 2003.
[3] S.E. Robertson and S. Walker, “On Relevance Weights with Little Relevance Information,” Proc. 20th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 16-24, 1997.
[4] M. Iwayama, “Relevance Feedback with a Small Number of Relevance Judgements: Incremental Relevance Feedback vs. Document Clustering,” Proc. 23rd Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 10-16, 2000.
[5] T. Onoda et al., “Non-Relevance Feedback Document Retrieval,” Proc. 2004 IEEE Conf. Cybernetics and Intelligent Systems, pp. 456-461, 2004.
[6] F. Diaz and D. Metzler, “Improving the Estimation of Relevance Models Using Large External Corpora,” Proc. 29th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp.154-161, 2006.
[7] X. Shen, B. Tan, and C. Zhai, “Context-Sensitive Information Retrieval Using Implicit Feedback,” Proc. 28th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp.43-50, 2005.
[8] S.E. Robertson, “Overview of the Okapi Projects,” J. Documentation, vol. 53, no. 1, pp. 3-7, 1997.
[9] G. Salton, “The Smart Document Retrieval Project,” Proc. 14th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 356-358, 1991.
[10] V. Vapnik, Statistical Learning Theory. Wiley, 1998.
[11] G.W. Flake et al., “Extracting Query Modification from Nonlinear SVMS,” Proc. 11th Int'l World Wide Web Conf., 2002.
[12] S. Oyama et al., “Keyword Spices: A New Method for Building Domain-Specific Web Search Engines,” Proc. 17th Int'l Joint Conf. Artificial Intelligence, 2001.
[13] R.W. White and D. Kelly, “A Study on the Effects of Personalization and Task Information on Implicit Feedback Performance,” Proc. 15th ACM Int'l Conf. Information and Knowledge Management (CIKM '06), pp. 297-306, 2006.
[14] A. Amir, M. Berg, and H. Permuter, “Mutual Relevance Feedback for Multimodal Query Formulation in Video Retrieval,” Proc. Seventh ACM SIGMM Int'l Workshop Multimedia Information Retrieval (MIR '05), pp. 17-24, 2005.
[15] S. Yu et al., “Improving Pseudo-Relevance Feedback in Web Information Retrieval Using Web Page Segmentation,” Proc. 12th Int'l World Wide Web Conf., 2003.
[16] S.E. Robertson, “On Term Selection for Query Expansion,” J.Documentation, vol. 46, no. 4, pp. 359-364, 1990.
[17] T. Joachims, “Transductive Inference for Text Classification Using Support Vector Machines,” Proc. Int'l Conf. Machine Learning (ICML '99), 1999.
[18] X. Zhu et al., “Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions,” Proc. 20th Int'l Conf. Machine Learning, pp. 912-914, 2003.
[19] A. Blum et al., “Semi-Supervised Learning Using Randomized Mincuts,” Proc. Int'l Conf. Machine Learning (ICML '04), 2004.
[20] T. Joachims, “Transductive Learning via Spectral Graph Partitioning,” Proc. Int'l Conf. Machine Learning (ICML '03), pp. 143-151, 2003.
[21] M. Okabe, K. Umemura, and S. Yamada, “Query Expansion with the Minimum User Feedback by Transductive Learning,” Proc. Human Language Technology Conf./Conf. Empirical Methods in Natural Language Processing, pp. 963-970, 2005.
[22] C. Buckley, G. Salton, and J. Allan, “The Effect of Adding Relevance Information in a Relevance Feedback Environment,” Proc. 14th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 292-300, 1994.
[23] E. Voorhees and D. Harman, “Overview of the Eighth Text Retrieval Conf.,” Proc. Eighth Text REtrieval Conf. (TREC '98), 1999.
[24] S.E. Robertson and S. Walker, “Okapi/Keenbow at TREC-8,” Proc. Eighth Text REtrieval Conf. (TREC '99), 1999.
[25] A. Singhal, J. Choi, D. Hindle, D.D. Lewis, and F. Pereira, “AT&T at TREC-7,” Proc. Seventh Text REtrieval Conf. (TREC '98), 1998.
[26] C. Zhai and J. Lafferty, “Model-Based Feedback in the Language Modeling Approach to Information Retrieval,” Proc. 10th Int'l Conf. Information and Knowledge Management (CIKM '01), pp. 403-410, 2001.

Index Terms:
Information Search and Retrieval, Query formulation, H.3.3.f Relevance feedback,Machine learning
Citation:
Masayuki Okabe, Seiji Yamada, "Semisupervised Query Expansion with Minimal Feedback," IEEE Transactions on Knowledge and Data Engineering, vol. 19, no. 11, pp. 1585-1589, Nov. 2007, doi:10.1109/TKDE.2007.190646
Usage of this product signifies your acceptance of the Terms of Use.