Scientific and Statistical Database Management, International Conference on (2006)
July 3, 2006 to July 5, 2006
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/SSDBM.2006.22
Lei Chen , Hong Kong University of Science and Technology
Xiang Lian , Hong Kong University of Science and Technology
Due to the wide usage of stream time series, an efficient and effective similarity search over stream data becomes essential for many applications. Although many approaches have been proposed for searching through archived data, because of the unique characteristics of the stream, for example, data are frequently updated, traditional methods may not work for the stream time series. Especially, for the cases where the arrival of data is often delayed for various reasons, for example, the communication congestion or batch processing and so on, queries on such incomplete time series or even future time series may result in inaccuracy. Therefore, in this paper we propose two approaches, polynomial and probabilistic, to predict the unknown values that have not arrived at the system. We also present efficient indexes, that is, a multidimensional hash index and B+-tree, to facilitate the prediction and similarity search on future time series, respectively. Extensive experiments demonstrate the efficiency and effectiveness of our methods in terms of I/O, prediction and query accuracy.
Lei Chen, Xiang Lian, "Efficient Methods on Predictions for Similarity Search over Stream Time Series", Scientific and Statistical Database Management, International Conference on, vol. 00, no. , pp. 241-250, 2006, doi:10.1109/SSDBM.2006.22