Fifth International Conference on Grid and Cooperative Computing Workshops
Large-Scale DNA Sequence Assembly by Using Computing Grid
Hunan, China
October 21-October 23
ISBN: 0-7695-2695-0
Zhigang Luo, National University of Defense Technology, China
Fan Ding, National University of Defense Technology, China
DNA sequence assembly is a fundamental part of biological computing. However, most of the largescale sequence assemblies require intensive computing power and huge storage. To speed up the assembly process, we here propose a method for large-scale DNA sequence assembly by using computing grid. The central idea of our method is to first cluster the input of fragment set into many non-intersected subsets using k-mers and then to distribute them to all nodes of the grid-computing system. Our method has accuracy of more than 92% on the test data sets under the simulated grid-computing system but costing shorter time and lower storage. Our method can efficiently process large-scale DNA sequence assembly by taking advantage of huge storage and computing capacity of computing gird.
Citation:
Xiaoyong Fang, Zhigang Luo, Zhenghua Wang, Fan Ding, "Large-Scale DNA Sequence Assembly by Using Computing Grid," gccw, pp.397-400, Fifth International Conference on Grid and Cooperative Computing Workshops, 2006