This Article 
 Bibliographic References 
 Add to: 
Optimizing Cyclic Join View Maintenance over Distributed Data Sources
March 2006 (vol. 18 no. 3)
pp. 363-376
Elke A. Rundensteiner, IEEE Computer Society
Materialized views defined over distributed data sources are critical for many applications to ensure efficient access, reliable performance, and high availability. Materialized views need to be maintained upon source updates since stale view extents may not serve well or may even mislead user applications. Thus, view maintenance performance is one of the keys to the success of these applications. In this work, we investigate two maintenance strategies, extended batching and view graph transformation, for maintaining general join views where join conditions may exist between any pairs of data sources possibly with cycles. Many choices are available for maintaining cyclic join views. We thus propose a cost-driven view maintenance framework which generates optimized maintenance plans tuned to the environmental settings. The proposed framework has been implemented in the TxnWrap system. Experimental studies illustrate that our proposed optimization techniques significantly improve the view maintenance performance in a distributed environment.

[1] D. Agrawal, A.E. Abbadi, A. Singh, and T. Yurek, “Efficient View Maintenance at Data Warehouses,” Proc. ACM SIGMOD, pp. 417-427, 1997.
[2] J.A. Blakeley, P.-A. Larson, and F.W. Tompa, “Efficiently Updating Materialized Views,” Proc. ACM SIGMOD, pp. 61-71, May 1986.
[3] J. Chen, X. Zhang, S. Chen, K. Andreas, and E.A. Rundensteiner, “DyDa: Data Warehouse Maintenance under Fully Concurrent Environments,” Proc. ACM SIGMOD Demo Session, p. 619, 2001.
[4] S. Chen, B. Liu, and E.A. Rundensteiner, “Multiversion Based View Maintenance over Distributed Data Sources,” ACM Trans. Database Systems (TODS), vol. 29, no. 4, pp. 675-709, 2004.
[5] L.S. Colby, T. Griffin, L. Libkin, I.S. Mumick, and H. Trickey, “Algorithms for Deferred View Maintenance,” Proc. SIGMOD, pp. 469-480, 1996.
[6] M.R. Garey and D.S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness. WH Freeman and Co., 1979.
[7] A. Gupta and I. Mumick, “Maintenance of Materialized Views: Problems, Techniques, and Applications,” IEEE Data Eng. Bull., vol. 18, no. 2, pp. 3-19, 1995.
[8] L.M. Haas, D. Kossmann, E.L. Wimmers, and J. Yang, “Optimizing Queries Across Diverse Data Sources,” Proc. Conf. Very Large Databases, pp. 276-285, 1997.
[9] H. He, J. Xie, J. Yang, and H. Yu, “Asymmetric Batch Incremental View Maintenance,” Proc. Int'l Conf. Data Eng., pp. 106-117, 2005.
[10] Z.G. Ives, D. Florescu, M. Friedman, A.Y. Levy, and D.S. Weld, “An Adaptive Query Execution System for Data Integration,” Proc. ACM SIGMOD, pp. 299-310, 1999.
[11] D. Kossmann, “The State of the Art in Distributed Query Processing,” ACM Computing Surveys (CSUR), vol. 32, no. 4, pp. 422-469, 2000.
[12] W.J. Labio, R. Yerneni, and H. García-Molina, “Shrinking the Warehouse Updated Window,” Proc. ACM SIGMOD, pp. 383-395, June 1999.
[13] K.Y. Lee, J.H. Son, and M.H. Kim, “Efficient Incremental View Maintenance in Data Warehouses,” Proc. Int'l Conf. Information and Knowledge Management (CIKM '01), pp. 349-356, Nov. 2001.
[14] B. Liu, S. Chen, and E.A. Rundensteiner, “Batch Data Warehouse Maintenance in Dynamic Environments,” Proc. Int'l Conf. Information and Knowledge Management (CIKM '02), pp. 68-75, Nov. 2002.
[15] B. Liu and E.A. Rundensteiner, “Cost-Driven General Join View Maintenance over Distributed Data Soruces,” Proc. Int'l Conf. Data Eng., pp. 578-579, Apr. 2005.
[16] B. Liu, E.A. Rundensteiner, and D. Finkel, “Restructuring View Maintenance Plans for Large Update Batches,” Technical Report WPI-CS-TR-03-29, Worcester Polytechnic Inst., 2003.
[17] L.F. Mackert and G.M. Lohman, “R* Optimizer Validation and Performance Evaluation for Local Queries,” Proc. ACM SIGMOD, pp. 84-95, 1986.
[18] J. Neter, M. Kunter, C. Nachtsheim, and W. Wasserman, Applied Linear Statistical Models. Times Mirror, 1996.
[19] K. O'Gorman, D. Agrawal, and A.E. Abbadi, “Posse: A Framework for Optimizing Incremental View Maintenance at Data Warehouse,” Data Warehousing and Knowledge Discovery, pp. 106-115, 1999.
[20] D. Quass, “Maintenance Expressions for Views with Aggregation,” Proc. Workshop Materialized Views: Techniques and Applications, June 1996.
[21] K. Salem, K.S. Beyer, R. Cochrane, and B.G. Lindsay, “How to Roll a Join: Asynchronous Incremental View Maintenance,” Proc. ACM SIGMOD, pp. 129-140, 2000.
[22] P.G. Selinger, M.M. Astrahan, D.D. Chamberlin, R.A. Lorie, and T.G. Price, “Access Path Selection in a Relational Database Management System,” Proc. ACM SIGMOD, pp. 23-34, 1979.
[23] T.K. Sellis, “Multiple-Query Optimization,” ACM Trans. Database Systems (TODS), vol. 13, no. 1, pp. 23-52, 1988.
[24] M. Stonebraker, P. Aoki, A. Pfeffer, A. Sah, J. Sidell, C. Staelin, and A. Yu, “Mariposa: A Wide-Area Distributed Database System,” VLDB J., vol. 5, no. 1, pp. 48-63, 1996.
[25] TPC, TPC-H Benchmark Standard Specification, http://www. tpc.orgtpch/, 2005.
[26] T. Urhan, M.J. Franklin, and L. Amsaleg, “Cost-Based Query Scrambling for Initial Delays,” Proc. ACM SIGMOD, pp. 130-141, 1998.
[27] C. Wang and M.-S. Chen, “On the Complexity of Distributed Query Optimization,” IEEE Trans. Knowledge and Data Eng., vol. 8, no. 4, pp. 650-662, Aug. 1996.
[28] Q. Zhu, Y. Sun, and S. Motheramgari, “Developing Cost Models with Qualitative Variables for Dynamic Multidatabase Environments,” Proc. Int'l Conf. Data Eng., pp. 413-424, 2000.
[29] Y. Zhuge, H. García-Molina, J. Hammer, and J. Widom, “View Maintenance in a Warehousing Environment,” Proc. ACM SIGMOD Conf., pp. 316-327, May 1995.
[30] Y. Zhuge, H. García-Molina, and J.L. Wiener, “The Strobe Algorithms for Multi-Source Warehouse Consistency,” Parallel and Distributed Information Systems, pp. 146-157, 1996.

Index Terms:
Materialized view maintenance, view maintenance optimization, distributed query optimization, grouping maintenance, view graph.
Bin Liu, Elke A. Rundensteiner, "Optimizing Cyclic Join View Maintenance over Distributed Data Sources," IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 3, pp. 363-376, March 2006, doi:10.1109/TKDE.2006.50
Usage of this product signifies your acceptance of the Terms of Use.