This Article 
 Bibliographic References 
 Add to: 
QoS Driven Parallelization of Resources to Reduce File Download Delay
October 2006 (vol. 17 no. 10)
pp. 1204-1215

Abstract—In this paper, we propose a novel approach for reducing the download time of large files over the Internet. Our approach, known as Parallelized File Transport Protocol (P-FTP), proposes simultaneous downloads of disjoint file portions from multiple file servers. P-FTP server selects file servers for the requesting client on the basis of a variety of QoS parameters, such as available bandwidth and server utilization. The sensitivity analysis of our file server selection technique shows that it performs significantly better than random selection. During the file transfer, P-FTP client monitors the file transfer flows to detect slow servers and congested links and adjusts the file distributions accordingly. P-FTP is evaluated with simulations and real-world implementation. The results show at least 50 percent reduction in download time when compared to the traditional file-transfer approach. Moreover, we have also carried out a simulation-based study to investigate the issues related to large scale deployment of our approach on the Internet. Our results demonstrate that a large number of P-FTP users has no adverse effect on the performance perceived by non-P-FTP users. In addition, the file servers and network are not significantly affected by large scale deployment of P-FTP.

[1] S. Sohail, S. Kanhere, S. Jha, A. Baig, and M. Malik, “A Parallelized File-Transfer-Protocol for On-Board IP Networks,” Proc. Vehicular Technology Conf., Spring 2005.
[2] G. Pultar, “Automatically Selecting a Close Mirror Based on Network Topology,” Proc. 12th Systems Administration Conf., pp. 159-165, 1998.
[3] J. Guyton and M. Schwartz, “Locating Nearby Copies of Replicated Internet Servers,” Proc. SIGCOMM, pp. 288-298, 1995.
[4] R. Carter and M. Crovella, “Server Selection Using Dynamic Path Characterization in Wide Area Networks,” Proc. INFOCOM, pp. 1014-1021, 1997.
[5] Z. Fu and N. Venkatasubramanian, “Directory Based Composite Routing and Scheduling Policies for Dynamic Multimedia Environment,” Proc. 15th Int'l Parallel and Distributed Processing Symp., p. 11, 2001.
[6] J. Byers et al., “Informed Content Delivery across Adaptive Overlay Networks,” IEEE/ACM Trans. Networking, vol 12, no. 5, pp. 767-780, 2004.
[7] M. Crovella and A. Bestavros, “Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes,” IEEE/ACM Trans. Networking, vol. 5, no. 6, pp. 835-846, 1997.
[8] P. Rodriguez and E. Biersack, “Dynamic Parallel Access to Replicated Content in the Internet,” IEEE/ACM Trans. Networking, vol. 10, no. 4, pp. 455-465, Aug. 2002.
[9] S. Sohail, S. Jha, and H. Elgindy, “Parallelized File Transfer Protocol,” Proc. IEEE Local Computer Networks Workshop High-Speed Local Networks, pp. 624-631, 2003.
[10] S. Sohail and S. Jha, “QoS Driven Parallelized File-Transfer-Protocol,” Proc. Australian Telecomm., Networks, and Application Conf., 2004.
[11] S. Sohail, C. Chou, S. Kanhere, and S. Jha, “On Large Scale Deployment of Parallelized File Transfer Protocol,” Proc. IEEE Int'l Performance Computing and Comm. Conf. (IPCCC), pp. 225-232, 2005.
[12] J. Doyle, Routing TCP/IP Volume I (CCIE Professional Development). Cisco Press, 1998.
[13] D. Katabi, I. Bazzi, and X. Yang, “A Passive Approach for Detecting Shared Bottlenecks,” Proc. IEEE Int'l Conf. Computer Comm. and Networks, pp. 174-181, 2001.
[14] S. Savage, “Sting: A TCP-Based Network Measurement Tool,” Proc. USENIX Symp. Internet Technologies and Systems, pp. 71-79, 1999.
[15] P. Francis et al., “IDMAPS: A Global Internet Host Distance Estimation Service,” IEEE/ACM Trans. Networking, vol. 9, pp. 524-540, 2001.
[16] K. Gummadi et al., “King Estimating Latency between Arbitrary Internet End Hosts,” Proc. SIGCOMM Internet Measurement Workshop, pp. 5-18, 2002.
[17] E. Ng and H. Zhang, “Predicting Internet Network Distance with Coordinates-Based Approaches,” Proc. IEEE INFOCOM, pp. 170-179, 2002.
[18] J. Postel and J. Reynolds, “File Transfer Protocol (FTP),” IETF RFC 959, 1985.
[19] J. Buyers et al., “Accessing Multiple Mirror Sites in Parallel: Using Tornado Code to Speed Up Downloads,” Proc. IEEE INFOCOM, pp. 275-283, 1999.
[20] S. Koo, C. Rosenberg, and D. Xu, “Analysis of Parallel Downloading for Large File Distribution,” Proc. IEEE Int'l Workshop Future Trends in Distributed Computing Systems, pp. 128-138, 2003.
[21] C. Gkantsidis, M. Ammar, and E. Zegura, “On the Effect of Large-Scale Deployment of Parallel Downloading,” Proc. Third IEEE Workshop Internet Applications, pp. 79-90, 2003.
[22] D. Menasce and V. Almeida, Capacity Planning for Web Services: Metrics, Models and Methods. Upper Saddle River, N.J.: Prentice Hall, 2001.
[23] H. Balakrishnan, M. Stemm, S. Seshan, and R. Katz, “Analyzing Stability in Wide-Area Network Performance,” Proc. ACM SIGMETRICS Conf. Measurement and Modeling of Computer Systems, pp. 2-12, 1997.
[24] S. Czerwinski, B. Zhao, T. Hodes, A. Joseph, and R. Katz, “An Architecture for a Secure Service Discovery Service,” Proc. ACM/IEEE Int'l Conf. Mobile Computing and Networking, pp. 24-35, 1999.
[25] D. Xu, K. Nahrstedt, and D. Wichadakul, “QoS-Aware Discovery of Wide-Area Distributed Services,” Proc. First IEEE/ACM Int'l Symp. Cluster Computing and the Grid, p. 92, 2001.
[26] J. Plank, S. Atchley, Y. Ding, and Y. Beck, “Algorithms for High Performance, Wide-Area Distributed File Downloads,” Parallel Processing Letters, vol. 13, no. 2, pp. 207-224, June 2003.
[27] R. Collins and S. Plank, “Downloading Replicated, Wide-Area Files— A Framework and Empirical Evaluation,” Proc. Third IEEE Int'l Symp. Network Computing and Applications, pp. 89-96, 2004.
[28] M. Allen and R. Wolski, “The Livny and Plank-Beck Problems: Studies in Data Movement on the Computational Grids,” Proc. ACM/IEEE SC2003 Conf. High Performance Networking and Computing, p. 43, 2003.
[29] M. Naf, “Dynamic Server Selection,” diploma thesis, Inst. for Computer Systems ETH Zürich, 1999.
[30] A. Gulbrandsen, P. Vixie, and L. Esibov, “A DNS RR for Specifying the Location of Services (DNS SRV),” IETF RFC 2782, 2000.
[31] S. Sohail, S. Jha, S. Kanhere, and C.T. Chou, “Reducing File Download Delay by QoS Driven Parallelization of Resources,” Technical Report 0518, School of Computer Science and Eng., Univ. of New South Wales, Sept. 2005.
[32] S. Sohail and S. Jha, “Parallelized FTP:- Effective Approach for Solving Huge Download Delay Problem over Internet,” Technical Report 0411, School of Computer Science and Eng., Univ. of New South Wales, Apr. 2004.
[33] R. Ja, The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling. Wiley-Interscience, 1991.
[34] Bandwidth Measurements by Ping, chapterch04.html, Apr. 2005.
[35] Jigsaw Download, http://atterer.netjigdo/, June 2004.
[36] Kazaa,, Dec. 2003.
[37] Further Network,, Dec. 2003.
[38] Download Accelerator Plus, www.speedbit.comDAPDL.asp, Dec. 2003.
[39] Pathchar, path char/, June 2004.
[40] Planet Lab, http:/, June 2004.
[41] Network Simulator-2,, June 2004.
[42] P-FTP Implementation Source Code, au/~sohailspftp-23jan.tar.gz , June 2004.
[43] P-FTP Simulation Source Code, , June 2004.
[44] Bittorent, http://bitconjurer.orgBitTorrent/, Dec. 2003.
[45] Slurpie, http:/, June 2004.
[46] BRITE, http://www.cs.bu.edubrite/, Jan. 2004.
[47] Akaroa, ns.html, Sept. 2003.
[48] Globus, , June 2004.
[49] Pchar, pchar/, Jan. 2004.
[50] Ping,, June 2004.
[51] Pure FTP Daemon, http:/, July 2004.
[52] Matlab, http:/, May 2005.

Index Terms:
Distributed applications, performance measurements, Internet.
Shaleeza Sohail, Sanjay K. Jha, Salil S. Kanhere, Chun Tung Chou, "QoS Driven Parallelization of Resources to Reduce File Download Delay," IEEE Transactions on Parallel and Distributed Systems, vol. 17, no. 10, pp. 1204-1215, Oct. 2006, doi:10.1109/TPDS.2006.144
Usage of this product signifies your acceptance of the Terms of Use.