
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
Rahul Garg, Vijay K. Garg, Yogish Sabharwal, "Efficient Algorithms for Global Snapshots in Large Distributed Systems," IEEE Transactions on Parallel and Distributed Systems, vol. 21, no. 5, pp. 620630, May, 2010.  
BibTex  x  
@article{ 10.1109/TPDS.2009.108, author = {Rahul Garg and Vijay K. Garg and Yogish Sabharwal}, title = {Efficient Algorithms for Global Snapshots in Large Distributed Systems}, journal ={IEEE Transactions on Parallel and Distributed Systems}, volume = {21}, number = {5}, issn = {10459219}, year = {2010}, pages = {620630}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPDS.2009.108}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Parallel and Distributed Systems TI  Efficient Algorithms for Global Snapshots in Large Distributed Systems IS  5 SN  10459219 SP620 EP630 EPD  620630 A1  Rahul Garg, A1  Vijay K. Garg, A1  Yogish Sabharwal, PY  2010 KW  Checkpointing KW  global snapshots KW  stable predicates. VL  21 JA  IEEE Transactions on Parallel and Distributed Systems ER   
[1] F. Mattern, "Efficient Algorithms for Distributed Snapshots and Global Virtual Time Approximation," J. Parallel and Distributed Computing, pp. 423434, Aug. 1993.
[2] M. Schulz, G. Bronevetsky, R. Fernandes, D. Marques, K. Pingali, and P. Stodghill, "Implementation, Evaluation of a Scalable ApplicationLevel CheckpointRecovery Scheme for MPI Programs," Proc. Supercomputing Conf. (SC '04), Nov. 2004.
[3] T.H. Lai and T.H. Yang, "On Distributed Snapshots," Information Processing Letters, vol. 25, no. 3, pp. 153158, 1987.
[4] L. Lamport, "Time, Clocks, and the Ordering of Events in a Distributed System," Comm. ACM, vol. 21, no. 7, pp. 558565, July 1978.
[5] K.M. Chandy and L. Lamport, "Distributed Snapshots: Determining Global States of Distributed Systems," ACM Trans. Computer Systems, vol. 3, no. 1, pp. 6375, Feb. 1985.
[6] M. Spezialetti and P. Kearns, "Efficient Distributed Snapshots," Proc. Sixth Int'l Conf. Distributed Computing Systems, pp. 382388, 1986.
[7] L. Bouge, "Repeated Snapshots in Distributed Systems with Synchronous Communication and Their Implementation in CSP," Theoretical Computer Science, vol. 49, pp. 145169, 1987.
[8] A.D. Kshemkalyani, M. Raynal, and M. Singhal, "An Introduction to Snapshot Algorithms in Distributed Computing," Distributed Systems Eng., vol. 2, no. 4, pp. 224233, http://stacks.iop.org/09671846/2224. Dec. 1995.
[9] J.M. Hélary, A. Mostefaoui, R.H.B. Netzer, and M. Raynal, "CommunicationBased Prevention of Useless Checkpoints in Distributed Computations," Distributed Computing, vol. 13, no. 1, pp. 2943, 2000.
[10] Q. Jiang, Y. Luo, and D. Manivannan, "An Optimistic Checkpointing and Message Logging Approach for Consistent Global Checkpoint Collection in Distributed Systems," J. Parallel and Distributed Computing, vol. 68, no. 12, pp. 15751589, 2008.
[11] A. Kshemkalyani and B. Wu, "Detecting Arbitrary Stable Properties Using Efficient Snapshots," IEEE Trans. Software Eng., vol. 5, no. 33, pp. 330346, May 2007.
[12] V.K. Garg, Concurrent and Distributed Computing in Java. Wiley & Sons, 2004.
[13] E.W. Dijkstra and C.S. Scholten, "Termination Detection for Diffusing Computations," Information Processing Letters, vol. 11, no. 4, pp. 14, Aug. 1980.
[14] K.M. Chandy and J. Misra, "How Processes Learn," Proc. Fourth Ann. ACM Symp. Principles of Distributed Computing, R. Strong, ed., pp. 204214, Aug. 1985.
[15] B. Awerbuch, "Complexity of Network Synchronization," J. ACM, vol. 32, no. 4, pp. 804823, Oct. 1985.
[16] R. Garg, V.K. Garg, and Y. Sabharwal, "Scalable Algorithms for Global Snapshots in Distributed Systems," Proc. 20th Ann. Int'l Conf. Supercomputing (ICS '06), pp. 269277, 2006.