|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2009 IEEE International Conference on Data Engineering
Online Anomaly Prediction for Robust Cluster Systems
March 29-April 02
ISBN: 978-0-7695-3545-6
| ASCII Text | x | ||
| Xiaohui Gu, Haixun Wang, "Online Anomaly Prediction for Robust Cluster Systems," Data Engineering, International Conference on, pp. 1000-1011, 2009 IEEE International Conference on Data Engineering, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDE.2009.128, author = {Xiaohui Gu and Haixun Wang}, title = {Online Anomaly Prediction for Robust Cluster Systems}, journal ={Data Engineering, International Conference on}, volume = {0}, year = {2009}, issn = {1084-4627}, pages = {1000-1011}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDE.2009.128}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Engineering, International Conference on TI - Online Anomaly Prediction for Robust Cluster Systems SN - 1084-4627 SP1000 EP1011 A1 - Xiaohui Gu, A1 - Haixun Wang, PY - 2009 VL - 0 JA - Data Engineering, International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDE.2009.128
In this paper, we present a stream-based mining algorithm for online anomaly prediction. Many real-world applications such as data stream analysis requires continuous cluster operation. Unfortunately, today's large-scale cluster systems are still vulnerable to various software and hardware problems. System administrators are often overwhelmed by the tasks of correcting various system anomalies such as processing bottlenecks (i.e., full stream buffers), resource hot spots, and service level objective (SLO) violations. Our anomaly prediction scheme raises early alerts for impending system anomalies and suggests possible anomaly causes. Specifically, we employ Bayesian classification methods to capture different anomaly symptoms and infer anomaly causes. Markov models are introduced to capture the changing patterns of different measurement metrics. More importantly, our scheme combines Markov models and Bayesian classification methods to predict when a system anomaly will appear in the foreseeable future and what are the possible anomaly causes. To the best of our knowledge, our work provides the first stream-based mining algorithm for predicting system anomalies. We have implemented our approach within the IBM System S distributed stream processing cluster, and conducted case study experiments using fully implemented distributed data analysis applications processing real application workloads. Our experiments show that our approach efficiently predicts and diagnoses several bottleneck anomalies with high accuracy while imposing low overhead to the cluster system.
Citation:
Xiaohui Gu, Haixun Wang, "Online Anomaly Prediction for Robust Cluster Systems," icde, pp.1000-1011, 2009 IEEE International Conference on Data Engineering, 2009
Usage of this product signifies your acceptance of the Terms of Use.
