This Article 
 Bibliographic References 
 Add to: 
The Minimal Cost Distribution Tree Problem for Recursive Expiration-Based Consistency Management
March 2004 (vol. 15 no. 3)
pp. 214-227

Abstract—The expiration-based scheme is widely used to manage the consistency of cached and replicated contents such as Web objects. In this approach, each replica is associated with an expiration time beyond which the replica has to be validated. While the expiration-based scheme has been investigated in the context of a single replica, not much work has been done on its behaviors with respect to multiple replicas. To allow for efficient consistency management, it is desirable to organize the replicas into a distribution tree where a lower level replica seeks validation with a higher level replica when its lifetime expires. This paper investigates the construction of a distribution tree for a given set of replicas with the objective of minimizing the total communication cost of consistency management. This is formulated as an optimization problem and is proven to be NP-complete. The optimal distribution tree is identified in some special cases and several heuristic algorithms are proposed for the general problem. The performance of the heuristic algorithms is experimentally evaluated against two classical graph-theoretic algorithms of tree construction: the shortest-paths tree and the minimum spanning tree.

[1] GT Internetwork Topology Models (GT-ITM),, 2003.
[2] M. . Arlitt and L. Williamson Carey, “Internet Web Servers: Workload Characterization and Performance Implications,” IEEE/ACM Trans. Networking, vol. 5, no. 5, pp. 631-645, Oct. 1997.
[3] K.L. Calvert, M.B. Doar, and E.W. Zegura, “Modeling Internet Topology,” IEEE Comm. Magazine, vol. 35, no. 6, pp. 160-163, June 1997.
[4] V. Cardellini, E. Casalicchio, M. Colajanni, and P.S. Yu, The State of the Art in Locally Distributed Web-Server Systems ACM Computing Surveys, vol. 34, no. 2, pp. 263-311, June 2002.
[5] V. Cate, Alex A Global File System Proc. USENIX File System Workshop, pp. 1-12, May 1992.
[6] E. Cohen and H. Kaplan, Aging through Cascaded Caches: Performance Issues in the Distribution of Web Content Proc. ACM SIGCOMM, pp. 41-53, Aug. 2001.
[7] E. Cohen and H. Kaplan, Refreshment Policies for Web Content Caches Proc. IEEE INFOCOM, pp. 1398-1406, Apr. 2001.
[8] T.H. Cormen, C.E. Leiserson, R.L. Rivest, and C. Stein, Introduction to Algorithms, second ed. MIT Press, 2001.
[9] M.E. Crovella and A. Bestavros, “Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes,” IEEE/ACM Trans. Networking, vol. 5, no. 6, pp. 835-846, Dec. 1997.
[10] J. Dilley, “The Effect of Consistency on Cache Response Time,” IEEE Network, vol. 14, no. 3, pp. 24-28, 2000.
[11] V. Duvvuri, P. Shenoy, and R. Tewari, Adaptive Leases: A Strong Consistency Mechanism for the World Wide Web IEEE Trans. Knowledge and Data Eng., vol. 15, no. 5, pp. 1266-1276, Sept./Oct. 2003.
[12] M. Faloutsos, P. Faloutsos, and C. Faloutsos, On Power-Law Relationships of the Internet Topology Proc. ACM SIGCOMM, pp. 251-262, Sept. 1999.
[13] M. Garey and D. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman, 1979.
[14] J. Gwertzman and M. Seltzer, World Wide Web Cache Consistency Proc. USENIX Ann. Technical Conf., pp. 141-151, Jan. 1996.
[15] Y.T. Hou, J. Pan, B. Li, X. Tang, and S. Panwar, Modeling and Analysis of an Expiration-Based Hierarchical Caching System Proc. IEEE GLOBECOM, pp. 2468-2472, Nov. 2002.
[16] R. Jain, The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling. John Wiley&Sons, 1991.
[17] S. Jamin, C. Jin, Y. Jin, D. Riaz, Y. Shavitt, and L. Zhang, On the Placement of Internet Instrumentation Proc. IEEE INFOCOM 2000, Mar. 2000.
[18] J. Jung, E. Sit, H. Balakrishnan, and R. Morris, DNS Performance and the Effectiveness of Caching IEEE/ACM Trans. Networking, vol. 10, no. 5, pp. 589-603, Oct. 2002.
[19] B. Krishnamurthy and C.E. Wills, Study of Piggyback Cache Validation for Proxy Caches in the World Wide Web Proc. First USENIX Symp. Internet Technologies and Systems (USITS), pp. 1-12, Dec. 1997.
[20] D. Li and D.R. Cheriton, Scalable Web Caching of Frequently Updated Objects Using Reliable Multicast Proc. Second USENIX Symp. Internet Technologies and Systems (USITS), pp. 1-12, Oct. 1999.
[21] A. Ninan, P. Kulkarni, P. Shenoy, K. Ramamritham, and R. Tewari, Scalable Consistency Maintenance in Content Distribution Networks Using Cooperative Leases IEEE Trans. Knowledge and Data Eng., vol. 15, no. 4, pp. 813-828, July/Aug. 2003.
[22] M. Rabinovich and O. Spatscheck, Web Caching and Replication. Addison-Wesley, 2002.
[23] P. Rodriguez and S. Sibal, SPREAD: Scalable Platform for Reliable and Efficient Automated Distribution Computer Networks, vol. 33, nos. 1-6, pp. 33-49, June 2000.
[24] A. Shaikh, R. Tewari, and M. Agrawal, On the Effectveness of DNS-Based Server Selection Proc. IEEE Infocom, Apr. 2001.
[25] X. Tang and S.T. Chanson, The Minimal Cost Distribution Tree Problem for Recursive Expiration-Based Consistency Management Technical Report HKUST-CS03-14, Dept. of Computer Science, Hong Kong Univ. of Science and Tech nology, Aug. 2003.
[26] B. Urgaonkar, A. Ninan, M. Raunak, P. Shenoy, and K. Ramamritham, Maintaining Mutual Consistency for Cached Web Objects Proc. 21st Int'l Conf. Distributed Computing Systems (ICDCS-21), pp. 371-380, Apr. 2001.
[27] B.M. Waxman, “Routing of Multipoint Connections,” IEEE J. Selected Areas in Comm., vol. 6, no. 9, pp. 1617-1622, Dec. 1988.
[28] J. Yin, L. Alvisi, M. Dahlin, and A. Iyengar, Engineering Web Cache Consistency ACM Trans. Internet Technology, vol. 2, no. 3, pp. 224-259, Aug. 2002.
[29] J. Yin, L. Alvisi, M. Dahlin, and C. Lin, Hierarchical Cache Consistency in a WAN Proc. Second USENIX Symp. Internet Technologies and Systems (USITS), pp. 13-24, Oct. 1999.
[30] J. Yin et al., "Volume Leases for Consistency in Large-Scale Systems," IEEE Trans. Knowledge and Data Eng., vol. 11, no. 4, July/Aug. 1999.
[31] H. Yu, L. Breslau, and S. Shenker, A Scalable Web Cache Consistency Architecture Proc. ACM SIGCOMM, pp. 163-174, Sept. 1999.
[32] G.K. Zipf, Human Behavior and the Principles of Least Effort. Addison-Wesley, 1949.

Index Terms:
Expiration-based consistency management, caching, replication, performance analysis, optimization, graph theory, NP-complete.
Xueyan Tang, Samuel T. Chanson, "The Minimal Cost Distribution Tree Problem for Recursive Expiration-Based Consistency Management," IEEE Transactions on Parallel and Distributed Systems, vol. 15, no. 3, pp. 214-227, March 2004, doi:10.1109/TPDS.2004.1264807
Usage of this product signifies your acceptance of the Terms of Use.