This Article 
 Bibliographic References 
 Add to: 
Communication State Transfer for the Mobility of Concurrent Heterogeneous Computing
October 2004 (vol. 53 no. 10)
pp. 1260-1273
In a dynamic environment where a process can migrate from one host to another host, communication state transfer is a key issue of process coordination. This paper presents a set of data communication and process migration protocols to support communication state transfer in a dynamic, distributed parallel environment. The protocols preserve the semantics of point-to-point communication; they guarantee message delivery, maintain message ordering, and do not introduce deadlock when blocking send or receive operations are performed during process migration. Analytical proofs and prototype implementation are conducted to confirm the correctness of the protocols. Analytical and experimental results show the proposed design is valid and has a true potential in network computing.

[1] D.S. Milojicic, F. Douglis, Y. Paindaveine, R. Wheeler, and S. Zhou, Process Migration technical report, TOG Research Inst., Dec. 1996.
[2] P. Smith and N.C. Hutchinson, Heterogeneous Process Migration: The Tui System Software Practice and Experience, vol. 28, no. 6, pp. 611-639, 1998.
[3] X.-H. Sun, V.K. Niak, and K. Chanchio, A Coordinated Approach for Process Migration in Heterogeneous Environments Proc. 1999 SIAM Parallel Processing Conf., Mar. 1999.
[4] S. Leutenegger and X.-H. Sun, Limitations of Cycle Stealing of Parallel Processing on a Network Of Homogeneous Workstations J. Parallel and Distributed Computing, vol. 43, no. 3, 1997.
[5] M. Harchol-Balter and A. Downey, Exploiting Process Lifetime Distribution for Dynamic Load Balancing ACM Trans. Computer Systems, vol. 15, 1997.
[6] P. Krueger and M. Livny, A Comparison of Preemtive and Non-Preemtive Load Balancing Proc. Eighth Int'l Conf. Distributed Computing Systems, pp. 336-343, 1988.
[7] L. Gong, X.-H. Sun, and E. Watson, Performance Modeling and Prediction of Non-Dedicated Network Computing IEEE Trans. Computers, vol. 51, no. 9, pp. 1041-1050, Sept. 2003.
[8] I. Foster and C. Kesselman, Globus: A Metacomputing Infrastructure Toolkit Int'l J. Supercomputer Applications, vol. 11, no. 2, pp. 115-128, 1997.
[9] HPCM: High Performance Computing Mobility http://meta. cs.iit.eduhpcm/, 2004.
[10] K. Chanchio and X.-H. Sun, Data Collection and Restoration for Heterogeneous Process Migration Software-Practice and Experience, vol. 32, Apr. 2002.
[11] G. Stellner, Consistent Checkpoints of PVM Applications Proc. First European PVM Users Group Meeting, 1994.
[12] R.A. Finkel, M.L. Scott, Y. Artsy, and H.-Y. Chang, Experience with Charlotte: Simplicity and Function in a Distributed Operating System IEEE Trans. Software Eng., vol. 15, no. 6, pp. 676-685, June 1989.
[13] E. Roush, The Freeze Free Algorithm for Process Migration PhD thesis, Computer Science Dept., Univ. of Illinois at Urbana-Champaign, May 1995.
[14] J. Casas, D. Clark, R. Konuru, S. Otto, R. Prouty, and J. Walpole, MPVM: A Migratable Transparent Version of PVM Computing Systems, vol. 8, no. 2, pp. 171-216, 1995.
[15] A. Geist, A. Beguelin, J. Dongarra, W. Jiang, R. Manchek, and V. Sunderam, PVM: Parallel Virtual Machine A Users' Guide and Tutorial for Networked Parallel Computing. MIT Press, 1994.
[16] G. Burns et al., LAM: An Open Cluster Environment for MPI Proc.f Supercomputing Symp. 1994, pp. 379-386, 1994.
[17] J. Squyres, A. Lumsdaine, W. George, J. Hagedorn, and J. Devaney, The Interoperable Message Passing Interface IMPI Extensions to LAM/MPI Proc. MPI Developer's Conf., 2000.
[18] I. Foster and C. Kesselman, The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, 1998.
[19] I. Stoica, R. Morris, D. Karger, F. Kaashoek, and H. Balakrishnan, Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications IEEE/ACM Trans. Networking, vol. 11, Feb. 2003.
[20] M.H. Theimer and B. Hayes, Heterogeneous Process Migration by Recompilation Proc. 11th IEEE Int'l Conf. Distributed Computing Systems, pp. 18-25, June 1991.
[21] D. von Bank, C.M. Shub, and R.W. Sebesta, A Unified Model of Pointwise Equivalence of Procedural Computations ACM Trans. Programming Languages and Systems, vol. 16, Nov. 1994.
[22] K.M. Chandy and L. Lamport, Distributed Snapshots: Determining Global States of Distributed System ACM Trans. Computer Systems, pp. 63-75, 1987.
[23] L. Lamport, Time, Clocks and the Ordering of Events in a Distributed System Comm. ACM, vol. 21, no. 7, pp. 558-565, 1978.
[24] S. White, A. Alund, and V.S. Sunderam, Performance of the NAS Parallel Benchmarks on PVM Based Networks Technical Report RNR-94-008, Dept. of Math. and Computer Science, Emory Univ., May 1994.
[25] P. Dan, W. Dongsheng, Z. Youhui, and S. Meiming, Quasi-Asynchronous Migration: A Novel Migration Proticol for PVM Tasks Operating Systems Rev., vol. 33, no. 2, pp. 5-14, 1999.
[26] G.D. van Albada, J. Clinckemaillie, A.H.L. Emmen, J. Gehring, O. Heinz, F. van der Linden, B.J. Overeinder, A. Reinefeld, and P.M.A. Sloot, Dynamite Blasting Obstacles to Parallel Cluster Computing Proc. Fifth Ann. Conf. Advanced School for Computing and Imaging (ASCI), M. Boasson, J.A. Kaandorp, J.F.M. Tonino, and M.G. Vosselman, eds., pp. 31-37, June 1999.
[27] K. Thitikamol and P. Keleher, Thread Migration and Communication Minimization in DSM Systems Proc. IEEE, vol. 87, special issue on distributed shared memory systems, pp. 487-497, Mar. 1999.
[28] C.D. Carothers and B.K. Szymanski, Linux Support for Transparent Checkpointing of Multithreaded Programs Dr. Dobb's J., Aug. 2002.
[29] M.V. Nibhanapudi and B.K. Szymanski, Runtime Support for Virtual BSP Computer Proc. 12th Int'l Parallel Processing Symp. (IPPS/SPDP), pp. 147-158, 1998.
[30] A.J. Ferrari, S.J. Chapin, and A.S. Grimshaw, Process Introspection: A Heterogeneous Checkpoint/Restart Mechanism Based on Automatic Code Modification Technical Report CS-97-05, Dept. of Computer Science, Univ. of Virginia, Mar. 1997.
[31] W. Milojicic, W. Zint, and A. Dangel, Task Migration on Top of the Mach Microkernel Design and Implementation Proc. Eighth Int'l Conf. Distributed Computing Systems, pp. 336-343, 1993.
[32] C. Tan, W. Wong, and C. Yuen, tmPVM Task Migratable PVM Proc. 10th Int'l Parallel Processing Symp. (IPPS/SPDP), 1996.
[33] C. Du, X.-H. Sun, and K. Chanchio, HPCM: A Pre-Compiler Aided Middleware for the Mobility of Legacy Code Proc. IEEE Cluster Computing Conf., Dec. 2003.

Index Terms:
Communication protocol, process migration, distributed and parallel processing, point-to-point communication.
Kasidit Chanchio, Xian-He Sun, "Communication State Transfer for the Mobility of Concurrent Heterogeneous Computing," IEEE Transactions on Computers, vol. 53, no. 10, pp. 1260-1273, Oct. 2004, doi:10.1109/TC.2004.73
Usage of this product signifies your acceptance of the Terms of Use.