This Article 
 Bibliographic References 
 Add to: 
Integrating Web Caching and Web Prefetching in Client-Side Proxies
May 2005 (vol. 16 no. 5)
pp. 444-455

Abstract—Web caching and Web prefetching are two important techniques used to reduce the noticeable response time perceived by users. Note that by integrating Web caching and Web prefetching, these two techniques can complement each other since the Web caching technique exploits the temporal locality, whereas Web prefetching technique utilizes the spatial locality of Web objects. However, without circumspect design, the integration of these two techniques might cause significant performance degradation to each other. In view of this, we propose in this paper an innovative cache replacement algorithm, which not only considers the caching effect in the Web environment, but also evaluates the prefetching rules provided by various prefetching schemes. Specifically, we formulate a normalized profit function to evaluate the profit from caching an object (i.e., either a nonimplied object or an implied object according to some prefetching rule). Based on the normalized profit function devised, we devise an innovative Web cache replacement algorithm, referred to as Algorithm IWCP (standing for the Integration of Web Caching and Prefetching). Using an event-driven simulation, we evaluate the performance of Algorithm IWCP under several circumstances. The experimental results show that Algorithm IWCP consistently outperforms the companion schemes in various performance metrics.

[1] Akamai Technologies Inc., http:/, 2004.
[2] Mirror Image Internet Inc., http:/, 2004.
[3] Sandpiper Networks/Digital Island, Inc., http:/www.sandpiper. net/, 2004.
[4] C. Aggarwal, J.L. Wolf, and P.-S. Yu, “Caching on the World Wide Web,” IEEE Trans. Knowledge and Data Eng., vol. 11, no. 1, pp. 94-107, Jan./Feb. 1999.
[5] P. Barford and M. Crovella, “Generating Representative Web Workloads for Network and Server Performance Evaluation,” Proc. 1998 ACM SIGMETRICS Int'l Conf. Measurements and Modeling of Computer Systems, 1998.
[6] G. Barish and K. Obraczka, “World Wide Web Caching: Trends and Techniques,” IEEE Comm. Magazine, Internet Technology Series, pp. 178-185, 2000.
[7] L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, “Web Caching and Zipf-Like Distributions: Evidence and Implications,” Proc. IEEE INFOCOM 1999, Mar. 1999.
[8] P. Cao, E.W. Felten, A. Karlin, and K. Li, “A Study of Integrated Prefetching and Caching Strategies,” Proc. 1995 ACM SIGMETRICS Int'l Conf. Measurements and Modeling of Computer Systems, pp. 188-197, 1995.
[9] P. Cao and S. Irani, “Cost-Aware WWW Proxy Caching Algorithms,” Proc. 1997 USENIX Symp. Internet Technology and Systems, 1997.
[10] M.-S. Chen, J.-S. Park, and P.S. Yu, “Efficient Data Mining for Path Traversal Patterns,” IEEE Trans. Knowledge and Data Eng., vol. 10, no. 2, pp. 209-221, Mar./Apr. 1998.
[11] K. Chinen and S. Yamaguchi, “An Interactive Prefetching Proxy Server for Improvement of WWW Latency,” Proc. Seventh Ann. Conf. Internet Soc., June 1997.
[12] E. Cohen, B. Krishnamurthy, and J. Rexford, “Improving End-to-End Performance of the Web Using Server Volumes and Proxy Filters,” Proc. ACM SIGCOMM 1998, pp. 241-253, 1998.
[13] M. Crovella and P. Barford, “The Network Effects of Prefetching,” Proc. IEEE INFOCOM 1998, pp. 1232-1240, 1998.
[14] C. Cunha, A. Bestavros, and M. Crovella, “Characteristics of WWW Client-Based Traces,” technical report, Boston Univ., Apr. 1995.
[15] B.D. Davison, “Predicting Web Actions from HTML Content,” Proc. 13th ACM Conf. Hypertext and Hypermedia, pp. 159-168, June 2002.
[16] M. Deshpande and G. Karypis, “Selective Markov Models for Predicting Web-Page Accesses,” Proc. First SIAM Int'l Conf. Data Mining, 2001.
[17] D. Duchamp, “Prefetching Hyperlinks,” Proc. Second USENIX Symp. Internet Technologies and Systems, pp. 127-138, Oct. 1999.
[18] L. Fan, P. Cao, J. Almeida, and A.Z. Broder, “Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol,” IEEE/ACM Trans. Networking, vol. 8, no. 3, pp. 281-293, 2000.
[19] L. Fan, P. Cao, W. Lin, and Q. Jacobson, “Web Prefetching between Low-Bandwidth Clients and Proxies: Potential and Performance,” Proc. 1999 ACM SIGMETRICS Int'l Conf. Measurements and Modeling of Computer Systems, pp. 178-187, 1999.
[20] S. Glassman, “A Caching Relay for the World Wide Web,” Computer Networks and ISDN Systems, vol. 27, 1994.
[21] A. Kraiss and G. Weikum, “Integrated Document Caching and Prefetching in Storage Hierarchies Based on Markov-Chain Predictions,” Very Large Databases J., vol. 7, no. 3, pp. 141-162, 1998.
[22] T. Kroeger, D.E. Long, and J. Mogul, “Exploiting the Bounds of Web Latency Reduction from Caching and Prefetching,” Proc. USENIX Symp. Internet Technologies and Systems, pp. 13-22, 1997.
[23] B. Lan, S. Bressan, B.C. Ooi, and K. Tan, “Rule-Assisted Prefetching in Web Server Caching,” Proc. 2000 ACM Int'l Conf. Information and Knowledge Management, 2000.
[24] R. Lempel and S. Moran, “Predictive Caching and Prefetching of Query Results in Search Engines,” Proc. 12th Int'l Conf. World Wide Web, pp. 19-28, May 2003.
[25] R. Lempel and S. Moran, “Optimizing Result Prefetching in Web Search Engines with Segmented Indices,” ACM Trans. Internet Technology, vol. 4, no. 1, pp. 31-59, Feb. 2004.
[26] A. Nanopoulos, D. Katsaros, and Y. Manolopoulos, “Effective Prediction of Web-User Accesses: A Data Mining Approach,” Proc. Workshop Web Usage Analysis and User Profiling (WebKDD), 2001.
[27] V. Padmanabhan and J.C. Mogul, “Using Predictive Prefetching to Improve World Wide Web Latency,” ACM SIGCOMM Computer Comm. Rev., vol. 26, no. 3, 1996.
[28] A. Papoulis, Probability, Random Variables and Stochastic Processes. McGraw Hill, 1991.
[29] J. Pitkow, “Summary of WWW Characteristics,” World Wide Web, vol. 2, nos. 1-2, pp. 3-13, 1999.
[30] J. Pitkow and P. Pirolli, “Mining Longest Repeating Subsequence to Predict World Wide Web Surfing,” Proc. Second USENIX Symp. Internet Technologies and Systems, 1999.
[31] K. Ross, “Hash-Routing for Collections of Shared Web Caches,” IEEE Network Magazine, pp. 37-44, Nov.-Dec. 1997.
[32] R.R. Sarukkai, “Link Prediction and Path Analysis Using Markov Chains,” Proc. Ninth Int'l World Wide Web Conf., 2000.
[33] J. Shim, P. Scheuermann, and R. Vingralek, “Proxy Cache Algorithms: Design, Implementation, and Performance,” IEEE Trans. Knowledge and Data Eng., vol. 11, no. 4, pp. 549-561, July/Aug. 1999.
[34] S. Williams, M. Abrams, C.R. Standridge, G. Abdulla, and E. Fox, “Removal Policies in Network Caches for World Wide Web Documents,” Proc. ACM SIGCOMM 1996, pp. 293-304, 1996.
[35] R.P. Wooster and M. Abrams, “Proxy Caching That Estimates Page Load Delays,” Proc. Sixth Int'l World Wide Web Conf., 1997.
[36] Y.-H. Wu and A.L. Chen, “Prediction of Web Page Accesses by Proxy Server Log,” World Wide Web, vol. 5, no. 1, pp. 67-88, 2002.
[37] J. Xu, Q. Hu, D.-L. Lee, and W.-C. Lee, “SAIU: An Efficient Cache Replacement Policy for Wireless On-Demand Broadcasts,” Proc. 2000 ACM CIKM Int'l Conf. Information and Knowledge Management, pp. 46-53, 2000.
[38] Q. Yang and H.H. Zhang, “Integrating Web Prefetching and Caching Using Prediction Models,” World Wide Web, vol. 4, no. 4, pp. 299-321, 2001.
[39] Q. Yang, H.H. Zhang, and I.T. Li, “Mining Web Logs for Prediction Models in WWW Caching and Prefetching,” Proc. Seventh ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 473-478, Aug. 2001.

Index Terms:
Web Proxy, caching, prefetching.
Wei-Guang Teng, Cheng-Yue Chang, Ming-Syan Chen, "Integrating Web Caching and Web Prefetching in Client-Side Proxies," IEEE Transactions on Parallel and Distributed Systems, vol. 16, no. 5, pp. 444-455, May 2005, doi:10.1109/TPDS.2005.56
Usage of this product signifies your acceptance of the Terms of Use.