loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
International Conference on Parallel Computing in Electrical Engineering, (PARELEC'04)
Towards Easy-to-Use Checkpointing of MPI Applications within CLUSTERIX
Dresden, Germany
September 07-September 10
ISBN: 0-7695-2080-4
Pawel Czarnul, Gdansk University of Technology, Poland
Arkadiusz Urbaniak, Gdansk University of Technology, Poland
Marcin Fraczak, Gdansk University of Technology, Poland
Maciej Dyczkowski, Wroclaw University of Technology
Bartlomiej Balcerek, Wroclaw University of Technology
While there exist many kernel and user level libraries/systems which support checkpointing working processes and resuming their operations, it is still very difficult to provide an easy-to-use tool to assist checkpointing parallel applications. In this work, we aim at the development of an easy-to-use user-guided library to support checkpointing parallel MPI applications to be executed within the CLUSTERIX environment i.e. a collection of distributed HPC clusters. We propose a programmer-assisted approach with process state packing and unpacking at the code level for SPMD HPC applications. Although the library is in its early stage of development we present checkpoint/restart times and application execution (interrupted by checkpointing) times for the proposed approach compared to the same application linked with the ckpt user level library.
Index Terms:
Process Checkpointing, Checkpointing Parallel Applications, Parallel Software Environments
Citation:
Pawel Czarnul, Arkadiusz Urbaniak, Marcin Fraczak, Maciej Dyczkowski, Bartlomiej Balcerek, "Towards Easy-to-Use Checkpointing of MPI Applications within CLUSTERIX," parelec, pp.390-393, International Conference on Parallel Computing in Electrical Engineering, (PARELEC'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.