loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth IEEE International Conference on Data Mining (ICDM'06)
Similarity of Temporal Query Logs Based on ARIMA Model
Hong Kong
December 18-December 22
ISBN: 0-7695-2701-9
Ning Liu, Microsoft Research Asia, China
Shuzhen Nong, Microsoft AdCenter, USA
Jun Yan, Microsoft Research Asia, China
Benyu Zhang, Microsoft Research Asia, China
Zheng Chen, Microsoft Research Asia, China
Ying Li, Microsoft AdCenter, USA
A challenging issue faced by modern information retrieval is that of determining and satisfying users? requirements relying only on very short text queries. In this paper, we propose an algorithm to find out related queries based on Auto-Regressive Integrated Moving Average (ARIMA) Model. First, we select and estimate ARIMA model of the temporal query logs. And then each query is denoted by a sequence of coefficients. We use the correlation of ARIMA coefficients as the similarity measurement. We call it as the ARIMA Temporal Similarity (ARIMA TS). This similarity describes how strongly two time series are linearly related. On the other hand, the ARIMA model could also be treated as a dimensionality reduction procedure. It can save storage space for a large database of the query logs. In addition, ARIMA model could be used as a tool to predict the trend of a query. The experimental results on two query logs of MSN search engine 1 demonstrate that the proposed approach can achieve better similarity measurement efficiently.
Citation:
Ning Liu, Shuzhen Nong, Jun Yan, Benyu Zhang, Zheng Chen, Ying Li, "Similarity of Temporal Query Logs Based on ARIMA Model," icdm, pp.975-979, Sixth IEEE International Conference on Data Mining (ICDM'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.