This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Query Representation through Lexical Association for Information Retrieval
Dec. 2012 (vol. 24 no. 12)
pp. 2260-2273
Pawan Goyal, University of Ulster, Londonderry
Laxmidhar Behera, Indian Institute of Technology, Kanpur and University of Ulster, Londonderry
Thomas Martin McGinnity, University of Ulster, Londonderry
A user query for information retrieval (IR) applications may not contain the most appropriate terms (words) as actually intended by the user. This is usually referred to as the term mismatch problem and is a crucial research issue in IR. Using the notion of relevance, we provide a comprehensive theoretical analysis of a parametric query vector, which is assumed to represent the information needs of the user. A lexical association function has been derived analytically using the system relevance criteria. The derivation is further justified using an empirical evidence from the user relevance criteria. Such analytical derivation as presented in this paper provides a proper mathematical framework to the query expansion techniques, which have largely been heuristic in the existing literature. By using the generalized retrieval framework, the proposed query representation model is equally applicable to the vector space model (VSM), Okapi best matching 25 (Okapi BM25), and Language Model (LM). Experiments over various data sets from TREC show that the proposed query representation gives statistically significant improvements over the baseline Okapi BM25 and LM as well as other well-known global query expansion techniques. Empirical results along with the theoretical foundations of the query representation confirm that the proposed model extends the state of the art in global query expansion.
Index Terms:
Mathematical model,Equations,Correlation,Information retrieval,Context,Markov processes,Indexes,language model,Information retrieval,lexical association,query expansion
Citation:
Pawan Goyal, Laxmidhar Behera, Thomas Martin McGinnity, "Query Representation through Lexical Association for Information Retrieval," IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 12, pp. 2260-2273, Dec. 2012, doi:10.1109/TKDE.2011.171
Usage of this product signifies your acceptance of the Terms of Use.