loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
17th International Conference on Data Engineering (ICDE'01)
Duality-Based Subsequence Matching in Time-Series Databases
Heidelberg, Germany
April 02-April 06
ISBN: 0-7695-1001-9
Yang-Sae Moon, Korea Advanced Institute of Science and Technology (KAIST)
Kyu-Young Whang, Korea Advanced Institute of Science and Technology (KAIST)
Woong-Kee Loh, Korea Advanced Institute of Science and Technology (KAIST)
Abstract: In this paper, we propose a new subsequence matching method, Dual Match, which exploits duality in constructing windows and significantly improves performance. Dual Match divides data sequences into disjoint windows and the query sequence into sliding windows, and thus, is a dual approach of the one by Faloutsos et al. (FRM in short), which divides data sequences into sliding windows and the query sequence into disjoint windows. We formally prove that our dual approach is correct, i.e., it incurs no false dismissal. We also prove that, given the minimum query length, there is a maximum bound of the window size to guarantee correctness of Dual Match and discuss the effect of the window size on performance. FRM causes a lot of false alarms (i.e., candidates that do not qualify) by storing minimum bounding rectangles rather than individual points representing windows to avoid excessive storage space required for the index. Dual Match solves this problem by directly storing points, but without incurring excessive storage overhead. Experimental results show that, in most cases, Dual Match provides large improvement in both false alarms and performance over FRM, given the same amount of storage space. In particular, for low selectivities (less than 10^{-4}), Dual Match significantly improves performance up to 430-fold. On the other hand, for high selectivities (more than 10^{-2}), it shows a very minor degradation (less than 29%). For selectivities in between (10^{-4} \approx 10^{-2}), Dual Match shows performance slightly better than that of FRM. Dual Match is also 4.10 \approx 25.6 times faster than FRM in building indexes of approximately the same size. Overall, these results indicate that our approach provides a new paradigm in subsequence matching that improves performance significantly in large database applications.
Citation:
Yang-Sae Moon, Kyu-Young Whang, Woong-Kee Loh, "Duality-Based Subsequence Matching in Time-Series Databases," icde, pp.0263, 17th International Conference on Data Engineering (ICDE'01), 2001
Usage of this product signifies your acceptance of the Terms of Use.