loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth IEEE International Conference on Data Mining (ICDM'04)
RDF: A Density-Based Outlier Detection Method using Vertical Data Representation
Brighton, United Kingdom
November 01-November 04
ISBN: 0-7695-2142-8
Dongmei Ren, North Dakota State University, Fargo
Baoying Wang, North Dakota State University, Fargo
William Perrizo, North Dakota State University, Fargo
Outlier detection can lead to discovering unexpected and interesting knowledge, which is critical important to some areas such as monitoring of criminal activities in electronic commerce, credit card fraud, etc. In this paper, we developed an efficient density-based outlier detection method for large datasets. Our contributions are: a) We introduce a relative density factor (RDF); b) Based on RDF, we propose an RDF-based outlier detection method which can efficiently prune the data points which are deep in clusters, and detect outliers only within the remaining small subset of the data; c) The performance of our method is further improved by means of a vertical data representation, P-trees. We tested our method with NHL and NBA data. Our method shows an order of magnitude speed improvement compared to the contemporary approaches.
Citation:
Dongmei Ren, Baoying Wang, William Perrizo, "RDF: A Density-Based Outlier Detection Method using Vertical Data Representation," icdm, pp.503-506, Fourth IEEE International Conference on Data Mining (ICDM'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.