Issue No. 12 - December (2011 vol. 22)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2011.86
Chen Yu , Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Cheng-Zhong Xu , Dept. of Electr. & Comput. Eng., Wayne State Univ., Detroit, MI, USA
Xiaofei Liao , Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Hai Jin , Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Haikun Liu , Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Live migration of virtual machines (VM) across physical hosts provides a significant new benefit for administrators of data centers and clusters. Previous memory-to-memory approaches demonstrate the effectiveness of live VM migration in local area networks (LAN), but they would cause a long period of downtime in a wide area network (WAN) environment. This paper describes the design and implementation of a novel approach, namely, CR/TR-Motion, which adopts checkpointing/recovery and trace/replay technologies to provide fast, transparent VM migration for both LAN and WAN environments. With execution trace logged on the source host, a synchronization algorithm is performed to orchestrate the running source and target VMs until they reach a consistent state. CR/TR-Motion can greatly reduce the migration downtime and network bandwidth consumption. Experimental results show that the approach can drastically reduce migration overheads compared with memory-to-memory approach in a LAN: up to 72.4 percent on application observed downtime, up to 31.5 percent on total migration time, and up to 95.9 percent on the data to synchronize the VM state. The application performance overhead due to migration is kept within 8.54 percent on average. The results also show that for a variety of workloads migrated across WANs, the migration downtime is less than 300 milliseconds.
wide area networks, checkpointing, computer centres, local area networks, virtual machines, network bandwidth consumption reduction, live virtual machine migration, asynchronous replication, state synchronization, data centers, data clusters, local area networks, wide area network, CR/TR-Motion, checkpointing technology, recovery technology, trace technology, replay technology, migration downtime reduction, Synchronization, Algorithm design and analysis, Wide area networks, Local area networks, Virtual machining, Checkpointing, virtual machine., Copy-on-write checkpointing, deterministic replay, live migration, trace
Chen Yu, Cheng-Zhong Xu, Xiaofei Liao, Hai Jin and Haikun Liu, "Live Virtual Machine Migration via Asynchronous Replication and State Synchronization," in IEEE Transactions on Parallel & Distributed Systems, vol. 22, no. , pp. 1986-1999, 2011.