Issue No. 01 - Jan. (2014 vol. 26)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2012.234
Alessia Albanese , Dept. of Appl. Sci., Univ. of Naples Parthenope, Naples, Italy
Sankar K. Pal , Indian Stat. Inst., Kolkata, India
Alfredo Petrosino , Dept. of Appl. Sci., Univ. of Naples Parthenope, Naples, Italy
Nowadays, the high availability of data gathered from wireless sensor networks and telecommunication systems has drawn the attention of researchers on the problem of extracting knowledge from spatiotemporal data. Detecting outliers which are grossly different from or inconsistent with the remaining spatiotemporal data set is a major challenge in real-world knowledge discovery and data mining applications. In this paper, we deal with the outlier detection problem in spatiotemporal data and describe a rough set approach that finds the top outliers in an unlabeled spatiotemporal data set. The proposed method, called Rough Outlier Set Extraction (ROSE), relies on a rough set theoretic representation of the outlier set using the rough set approximations, i.e., lower and upper approximations. We have also introduced a new set, named Kernel Set, that is a subset of the original data set, which is able to describe the original data set both in terms of data structure and of obtained results. Experimental results on real-world data sets demonstrate the superiority of ROSE, both in terms of some quantitative indices and outliers detected, over those obtained by various rough fuzzy clustering algorithms and by the state-of-the-art outlier detection methods. It is also demonstrated that the kernel set is able to detect the same outliers set but with less computational time.
Approximation methods, Set theory, Kernel, Knowledge engineering, Data engineering, Data mining, Uncertainty
A. Albanese, S. K. Pal and A. Petrosino, "Rough Sets, Kernel Set, and Spatiotemporal Outlier Detection," in IEEE Transactions on Knowledge & Data Engineering, vol. 26, no. 1, pp. 194-207, 2013.