Modeling and Analysis of Communication Networks in Multicluster Systems under Spatio-Temporal Bursty Traffic
Issue No.05 - May (2012 vol.23)
Geyong Min , University of Bradford, Bradford
Keqiu Li , Dalian University of Technology, Dalian
Yulei Wu , University of Bradford, Bradford
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2011.198
Multicluster systems have emerged as a promising infrastructure for provisioning of cost-effective high-performance computing and communications. Analytical models of communication networks in cluster systems have been widely reported. However, for tractability and simplicity, the existing models are based on the assumptions that the network traffic follows the nonbursty Poisson arrival process and the message destinations are uniformly distributed. Recent measurement studies have shown that the traffic generated by real-world applications reveals the bursty nature in both the spatial domain (i.e., nonuniform distribution of message destinations) and temporal domain (i.e., bursty message arrival process). In order to obtain a comprehensive understanding of the system performance, a novel analytical model is developed for communication networks in multicluster systems in the presence of the spatio-temporal bursty traffic. The spatial traffic burstiness is captured by the communication locality and the temporal traffic burstiness is modeled by the Markov-modulated Poisson process. After validating its accuracy through extensive simulation experiments, the model is used to investigate the impact of bursty message arrivals and communication locality on network performance. The analytical results demonstrate that the communication locality can relieve the degrading effects of bursty message arrivals on the network performance.
Cluster computing, bursty traffic, communication locality, fat tree, performance modeling.
Geyong Min, Keqiu Li, Yulei Wu, "Modeling and Analysis of Communication Networks in Multicluster Systems under Spatio-Temporal Bursty Traffic", IEEE Transactions on Parallel & Distributed Systems, vol.23, no. 5, pp. 902-912, May 2012, doi:10.1109/TPDS.2011.198