|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2010 IEEE International Conference on Data Mining
Data Editing Techniques to Allow the Application of Distance-Based Outlier Detection to Streams
Sydney, Australia
December 13-December 17
ISBN: 978-0-7695-4256-0
| ASCII Text | x | ||
| Vit Niennattrakul, Eamonn Keogh, Chotirat Ann Ratanamahatana, "Data Editing Techniques to Allow the Application of Distance-Based Outlier Detection to Streams," Data Mining, IEEE International Conference on, pp. 947-952, 2010 IEEE International Conference on Data Mining, 2010. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2010.56, author = {Vit Niennattrakul and Eamonn Keogh and Chotirat Ann Ratanamahatana}, title = {Data Editing Techniques to Allow the Application of Distance-Based Outlier Detection to Streams}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2010}, issn = {1550-4786}, pages = {947-952}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2010.56}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - Data Editing Techniques to Allow the Application of Distance-Based Outlier Detection to Streams SN - 1550-4786 SP947 EP952 A1 - Vit Niennattrakul, A1 - Eamonn Keogh, A1 - Chotirat Ann Ratanamahatana, PY - 2010 KW - Data editing KW - Anomaly detection KW - Data stream VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2010.56
The problem of finding outliers in data has broad applications in areas as diverse as data cleaning, fraud detection, network monitoring, invasive species monitoring, etc. While there are dozens of techniques that have been proposed to solve this problem for static data collections, very simple distance-based outlier detection methods are known to be competitive or superior to more complex methods. However, distance-based methods have time and space complexities that make them impractical for streaming data and/or resource limited sensors. In this work, we show that simple data-editing techniques can make distance-based outlier detection practical for very fast streams and resource limited sensors. Our technique generalizes to produce two algorithms, which, relative to the original algorithm, can guarantee to produce no false positives, or guarantee to produce no false negatives. Our methods are independent of both data type and distance measure, and are thus broadly applicable.
Index Terms:
Data editing, Anomaly detection, Data stream
Citation:
Vit Niennattrakul, Eamonn Keogh, Chotirat Ann Ratanamahatana, "Data Editing Techniques to Allow the Application of Distance-Based Outlier Detection to Streams," icdm, pp.947-952, 2010 IEEE International Conference on Data Mining, 2010
Usage of this product signifies your acceptance of the Terms of Use.
