This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Enabling Concept-Based Relevance Feedback for Information Retrieval on the WWW
July/August 1999 (vol. 11 no. 4)
pp. 595-609

Abstract—The World Wide Web is a world of great richness, but finding information on the Web is also a great challenge. Keyword-based querying has been an immediate and efficient way to specify and retrieve related information that the user inquires. However, conventional document ranking based on an automatic assessment of document relevance to the query may not be the best approach when little information is given, as in most cases. In order to clarify the ambiguity of the short queries given by users, we propose the idea of concept-based relevance feedback for Web information retrieval. The idea is to have users give two to three times more feedback in the same amount of time that would be required to give feedback for conventional feedback mechanisms. Under this design principle, we apply clustering techniques to the initial search results to provide concept-based browsing. We show the performances of various feedback interface designs and compare their pros and cons. We shall measure precision and relative recall to show how clustering improves performance over conventional similarity ranking and, most importantly, we shall show how the assistance of concept-based presentation reduces browsing labor.

[1] P.G. Anick and S. Vaithyanathan, “Exploiting Clustering and Phrases for Context-Based Information Retrieval,” Proc. ACM SIGIR Int'l Conf. Research and Development in Information Retrieval, pp. 314–323, 1997.
[2] C.H. Chang and C.C. Hsu, “A Customizable Multiengine Search Tool with Clustering,” Computer Networks and ISDN Systems, vol. 29, nos. 8-13, pp. 1,217–1,224, Sept. 1997; also appeared in Proc. Sixth Int'l WWW Conf., Apr. 1997.
[3] C.H. Chang and C.C. Hsu, “Exploiting Hyperlinks for Automatic Information Discovery on the WWW,” Proc. 10th IEEE Int'l Conf. Tools with Artificial Intelligence, Chien Tan Youth Activity Center, Taipei, Taiwan, Nov. 1998.
[4] W.B. Croft, R. Cook, and D. Wilder, “Providing Government Information on the Internet: Experiences with Thomas,” Proc. Digital Libraries Conf., pp. 19–25, 1995.
[5] D.R. Cutting et al., “Scatter/Gather: A Cluster-Base Approach to Browsing Large Document Collections,” Proc. ACM SIGIR '92, 1992.
[6] D. Dreilinger and A. Howe, “An Information Gathering Agent for Querying Web Search Engines,” Technical Report CS-96-111, Colorado State Univ., 1996.
[7] D. Harman, “Relevance Feedback Revisited,” Proc. 15th Int'l Conf. Research and Development in Information Retrieval, pp. 1-10, 1992.
[8] M.A. Hearst and J.O. Pedersen, “Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results,” Proc. ACM SIGIR Int'l Conf. Research and Development in Information Retrieval, 1996.
[9] B. Pinkerton, “Finding What People Want: Experiences with the WebCrawler,” Proc. Second Int'l WWW Conf., Chicago, 1994.
[10] M.F. Porter, “An Algorithm for Suffix Stripping,” Program, vol. 14, no. 3, pp. 130–137, July 1980.
[11] E. Rasmussen, "Clustering Algorithms," Information Retrieval: Data Structures and Algorithms, W.B. Frakes and R. Baeza-Yates, eds. Englewood Cliffs, N.J.: Prentice Hall, 1992.
[12] S.E. Robertson and K. Sparck Jones, “Relevance Weighting of Search Terms,” J. Am. Soc. Information Science, vol. 27, no. 3, pp. 129–146, 1976.
[13] J.J. Rocchio, “Relevance Feedback in Information Retrieval,” G. Salton, ed., SMART Retrieval System, Prentice Hall, pp. 313–323, 1971.
[14] E. Selberg and O. Etzioni, “Multi-Engine Search and Comparison Using the MetaCrawler,” Proc. Fourth Int'l WWW Conf., Boston, 1995.
[15] E. Voorhees, “Query Expansion Using Lexical-Semantic Relations,” Proc. ACM SIGIR Int'l Conf. Research and Development in Information Retrieval, pp. 61-69, 1994.
[16] P. Willett, “Recent Trends in Hierarchical Document Clustering: A Critical Review,” Information Processing and Management, vol. 24, no. 5, pp. 577–597, 1988.
[17] J. Xu and W.B. Croft, “Query Expansion Using Local and Global Document Analysis,” Proc. ACM SIGIR Int'l Conf. Research and Development in Information Retrieval, pp. 4-11, 1996.

Index Terms:
Query expansion, relevance feedback, concept-based feedback, keyword extraction, document clustering, document-based browsing, cluster-based browsing.
Citation:
Chia-Hui Chang, Ching-Chi Hsu, "Enabling Concept-Based Relevance Feedback for Information Retrieval on the WWW," IEEE Transactions on Knowledge and Data Engineering, vol. 11, no. 4, pp. 595-609, July-Aug. 1999, doi:10.1109/69.790812
Usage of this product signifies your acceptance of the Terms of Use.