Cluster Computing and the Grid, IEEE International Symposium on (2006)
May 16, 2006 to May 19, 2006
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CCGRID.2006.5
Pallab Datta , Los Alamos National Laboratory, USA
Sushant Sharma , Los Alamos National Laboratory, USA
Wu-Chun Feng , Virginia Tech, USA
Next-generation e-Science applications will require the ability to transfer information at high data rates between distributed computing centers and data repositories. A Lambda- Grid offers dedicated, optical, circuit-switched, point-topoint connections, which may be reserved exclusively for an application. Though such dedicated high-speed connections eliminate congestion in the network, they effectively push the network congestion out to the end systems, as processing speeds have not kept up with networking speeds. Therefore, developing an efficient transport protocol over such highspeed dedicated circuits is of critical importance. <p>In this work, we propose the idea of a lightweight endsystem protocol, based on performance monitoring, to significantly improve the performance of data transport over a LambdaGrid. In particular, we focus on dynamically monitoring the OS task scheduling at the receiving end-system so that potential end-system congestion may be detected early and appropriate feedback can be transmitted back to the sending end-system to avoid packet losses. One example of such an evasive action is to suspend transmission for a certain duration of time during which the OS on the receiving end-system must handle other computational processes. With this in mind, we propose to extend the Reliable-Blast UDP (RBUDP) protocol to take such evasive action by using a simple feedback mechanism that is activated via performance monitoring. The new protocol, named RBUDP+ dramatically improves the performance of data transfer over LambdaGrids. We demonstrate the effectiveness of our proposed protocol and illustrate the performance gains achieved via network emulation</p>
W. Feng, S. Sharma and P. Datta, "A Feedback Mechanism for Network Scheduling in LambdaGrids," Cluster Computing and the Grid, IEEE International Symposium on(CCGRID), Singapore, 2006, pp. 584-591.