Dec. 13, 2010 to Dec. 15, 2010
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PRDC.2010.12
Dependencies between failures in operational networks may have a huge impact on their reliability and availability. In this paper we analyze failure logs to identify simultaneous and potentially correlated failures in routers and links of an IP backbone network. We show that the actual behavior of failure processes does not support the independence assumption commonly used in theoretical studies. Scatter plots are presented to visualize the failure processes, and it is seen that geographical adjacency has a pronounced effect. The existence of high correlation coefficients and high autocorrelation in some failure processes was observed. A formal analysis confirms this. The consequences of these dependencies on the provisioning of guaranteed availability are briefly discussed.
Bjarne E. Helvik, Jon K. Hellan, Andres J. Gonzalez, "Analysis of Dependencies between Failures in the UNINETT IP Backbone Network", PRDC, 2010, Pacific Rim International Symposium on Dependable Computing, IEEE, Pacific Rim International Symposium on Dependable Computing, IEEE 2010, pp. 149-156, doi:10.1109/PRDC.2010.12