Informatics, Balkan Conference in (2009)
Sept. 17, 2009 to Sept. 19, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/BCI.2009.22
In this paper, we present a new method that accelerates similarity search implemented via one-nearest neighbor on time series data. The main idea is to identify the most similar time series to a given query without necessarily searching over the whole database. Our method is based on partitioning the search space by applying the K-means algorithm on the data. Then, similarity search is performed hierarchically starting from the cluster that lies most closely to the query. This procedure aims at reaching the most similar series without searching all clusters. In this work, we propose to reduce the intrinsically high dimensionality of time series prior to clustering by applying a well known dimensionality reduction technique, namely, the Piecewise Aggregate Approximation, for its simplicity and efficiency. Experiments are conducted on twelve real-world and synthetic datasets covering a wide range of applications.
similarity search, clustering, time series, data mining
G. Evangelidis and L. Karamitopoulos, "Cluster-Based Similarity Search in Time Series," Informatics, Balkan Conference in(BCI), Thessaloniki, Greece, 2009, pp. 113-118.