|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2011 Sixth International Conference on Availability, Reliability and Security
Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems
Vienna, Austria
August 22-August 26
ISBN: 978-0-7695-4485-4
| ASCII Text | x | ||
| Qiang Guan, Ziming Zhang, Song Fu, "Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems," 2012 Seventh International Conference on Availability, Reliability and Security, pp. 83-90, 2011 Sixth International Conference on Availability, Reliability and Security, 2011. | |||
| BibTex | x | ||
| @article{ 10.1109/ARES.2011.20, author = {Qiang Guan and Ziming Zhang and Song Fu}, title = {Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems}, journal ={2012 Seventh International Conference on Availability, Reliability and Security}, volume = {0}, year = {2011}, isbn = {978-0-7695-4485-4}, pages = {83-90}, doi = {http://doi.ieeecomputersociety.org/10.1109/ARES.2011.20}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - 2012 Seventh International Conference on Availability, Reliability and Security TI - Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems SN - 978-0-7695-4485-4 SP83 EP90 A1 - Qiang Guan, A1 - Ziming Zhang, A1 - Song Fu, PY - 2011 KW - Cloud systems KW - Dependable systems KW - Learning algorithms KW - Bayesian detector KW - Decision tree VL - 0 JA - 2012 Seventh International Conference on Availability, Reliability and Security ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ARES.2011.20
Cloud computing systems continue to grow in their scale and complexity. They are changing dynamically as well due to the addition and removal of system components, changing execution environments, frequent updates and upgrades, online repairs and more. In such large-scale complex and dynamic systems, failures are common. In this paper, we present a failure prediction mechanism exploiting both unsupervised and semi-supervised learning techniques for building dependable cloud computing systems. The unsupervised failure detection method uses an ensemble of Bayesian models. It characterizes normal execution states of the system and detects anomalous behaviors. After the anomalies are verified by system administrators, labeled data are available. Then, we apply supervised learning based on decision tree classier to predict future failure occurrences in the cloud. Experimental results in an institute-wide cloud computing system show that our proposed method can forecast failure dynamics with high accuracy.
Index Terms:
Cloud systems, Dependable systems, Learning algorithms, Bayesian detector, Decision tree
Citation:
Qiang Guan, Ziming Zhang, Song Fu, "Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems," ares, pp.83-90, 2011 Sixth International Conference on Availability, Reliability and Security, 2011
Usage of this product signifies your acceptance of the Terms of Use.
