Proceedings 16th Workshop on Parallel and Distributed Simulation (2002)
May 12, 2002 to May 15, 2002
Ranjit Noronha , State University of New York
Nael B. Abu-Ghazaleh , State University of New York
Parallel Discrete Event Simulation (PDES)on a cluster of workstations is a ?ne grained application where the communication performance can dictate the ef?ency of the simulation. The high performance Local/System Area Networks used in high-end clusters are capable of delivering data with high bandwidth and low latency. Unfortunately, the communication rate far out-paces the capabilities of workstation nodes to handle it (I/O bus, memory bus, CPU resources). For this reason, many vendors are offering a programmable processor on the NIC to allow application speci?c optimization of the communication path. This invites a new implementation model for distributed applications where:(i)application speci?c communication optimizations can be implemented on the NIC; (ii)portions of the application that are most heavily communicating can be migrated to the NIC; (iii)some messages can be ?ltered out at the NIC without burdening the primary processor resources; and (iv)critical events are detected and handled early. The aim of our research is to investigate the utility of this model for PDES and to gain initial experiences in the implementation challenges and potential performance improvement. In this paper, we present our experiences with Early Cancellation -a optimization for Time-Warp that cancels messages in place upon early discovery of a rollback. We believe that there is a large scope for additional optimizations using this model.
Clusters, Global Virtual Time, Programmable NIC, Time Warp Event Simulation
R. Noronha and N. B. Abu-Ghazaleh, "Early Cancellation: An Active NIC Optimization for Time-Warp," Proceedings 16th Workshop on Parallel and Distributed Simulation(PADS), Washington, D.C., 2002, pp. 43.