2006 IEEE International Conference on Cluster Computing (2006)
Sept. 25, 2006 to Sept. 28, 2006
R.A. Oldfield , Sandia Nat. Labs., Albuquerque, NM
L. Ward , Sandia Nat. Labs., Albuquerque, NM
R. Riesen , Sandia Nat. Labs., Albuquerque, NM
Today's high-end massively parallel processing (MPP) machines have thousands to tens of thousands of processors, with next-generation systems planned to have in excess of one hundred thousand processors. For systems of such scale, efficient I/O is a significant challenge that cannot be solved using traditional approaches. In particular, general purpose parallel file systems that limit applications to standard interfaces and access policies do not scale and will likely be a performance bottleneck for many scientific applications. In this paper, we investigate the use of a "lightweight" approach to I/O that requires the application or I/O-library developer to extend a core set of critical I/O functionality with the minimum set of features and services required by its target applications. We argue that this approach allows the development of I/O libraries that are both scalable and secure. We support our claims with preliminary results for a lightweight checkpoint operation on a development cluster at Sandia
lightweight checkpoint operation, lightweight I/O, scientific applications, high-end massively parallel processing machines, parallel file systems, I/O-library developer, critical I/O functionality
A. Maccabe, P. Widener, R. Oldfield, L. Ward, T. Kordenbrock and R. Riesen, "Lightweight I/O for Scientific Applications," 2006 IEEE International Conference on Cluster Computing(CLUSTER), Barcelona, 2006, pp. 1-11.