Parallel Architectures, Algorithms and Programming, International Symposium on (2010)
Dalian, Liaoning China
Dec. 18, 2010 to Dec. 20, 2010
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PAAP.2010.26
Performance problems, which can stem from different system components, such as network, memory, and storage devices, are difficult to diagnose and isolate in a cluster file system. In this paper, we present an online performance anomaly detector which is able to efficiently detect performance anomaly and accurately identify the faulty sources in a system node of a cluster file system. Our method exploits the stable relationship between workloads and system resource statistics to detect the performance anomaly and identify faulty sources which cause the performance anomaly in the system. Our preliminary experimental results demonstrate the efficiency and accuracy of the proposed performance anomaly detector.
performance anomaly detector, cluster file system
He Guo, Xin Chen, Xubin He, Yuxin Wang, "An Online Performance Anomaly Detector in Cluster File Systems", Parallel Architectures, Algorithms and Programming, International Symposium on, vol. 00, no. , pp. 191-198, 2010, doi:10.1109/PAAP.2010.26