This Article 
 Bibliographic References 
 Add to: 
A Compensation-Based Approach for View Maintenance in Distributed Environments
August 2006 (vol. 18 no. 8)
pp. 1068-1081
Data integration over multiple heterogeneous data sources has become increasingly important for modern applications. The integrated data is usually stored as materialized views to allow better access, performance, and high availability. In loosely coupled environments, such as the Data Grid, the data sources are autonomous. Hence, the source updates can be concurrent and cause erroneous results during view maintenance. State-of-the-art maintenance strategies apply compensating queries to correct such errors, making the restricting assumption that all source schemata remain static over time. However, in such dynamic environments, the data sources may change not only their data but also their schema. Consequently, either the maintenance queries or the compensating queries may fail. In this paper, we propose a novel framework called DyDa that overcomes these limitations and handles both source data updates and schema changes. We identify three types of maintenance anomalies, caused by either source data updates, data-preserving schema changes, or non-data-preserving schema changes. We propose a compensation algorithm to solve the first two types of anomalies. We show that the third type of anomaly is caused by the violation of dependencies between maintenance processes. Then, we propose dependency detection and correction algorithms to identify and resolve the violations. Put together, DyDa extends prior maintenance solutions to solve all types of view maintenance anomalies. The experimental results show that DyDa imposes a minimal overhead on data update processing while allowing for the extended functionality to handle concurrent schema changes.

[1] D. Agrawal, A.E. Abbadi, A. Singh, and T. Yurek, “Efficient View Maintenance at Data Warehouses,” Proc. ACM SIGMOD Conf., pp. 417-427, 1997.
[2] P.A. Bernstein, V. Hadzilacos, and N. Goodman, Concurrency Control and Recovery in Database Systems. Addison-Wesley, 1987.
[3] J. Chen, X. Zhang, S. Chen, K. Andreas, and E.A. Rundensteiner, “DyDa: Data Warehouse Maintenance under Fully Concurrent Environments,” Proc. ACM SIGMOD Conf., p. 619, 2001.
[4] S. Chen, “Efficient Incremental View Maintenance for Data Warehousing,” PhD thesis, Computer Science Dept., Worcester Polytechnic Inst., 2005.
[5] S. Chen, J. Chen, X. Zhang, and E.A. Rundensteiner, “Detection and Correction of Conflicting Source Updates for View Maintenance,” Proc. Int'l Conf. Data Eng., pp. 436-448, 2004.
[6] S. Chen, B. Liu, and E.A. Rundensteiner, “Multiversion Based View Maintenance over Distributed Data Sources,” ACM Trans. Database Systems (TODS), vol. 29, no. 4, pp. 675-709, 2004.
[7] L.S. Colby, T. Griffin, L. Libkin, I.S. Mumick, and H. Trickey, “Algorithms for Deferred View Maintenance,” Proc. ACM SIGMOD Conf., pp. 469-480, 1996.
[8] H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J.D. Ullman, V. Vassalos, and J. Widoms, “The Tsimmis Approach to Mediation: Data Models and Languages,” J. Intelligent Information Systems, vol. 8, no. 2, pp. 117-132, 1997.
[9] T. Griffin and L. Libkin, “Incremental Maintenance of Views with Duplicates,” Proc. ACM SIGMOD Conf., pp. 328-339, 1995.
[10] A. Gupta, I.S. Mumick, J. Rao, and K.A. Ross, “Adapting Materialized Views after Redefinitions: Techniques and a Performance Study,” Information Systems, vol. 26, no. 5, pp. 323-362, 2001.
[11] A. Gupta, I.S. Mumick, and V.S. Subrahmanian, “Maintaining Views Incrementally,” Proc. ACM SIGMOD Conf., pp. 157-166, 1993.
[12] L.M. Haas, D. Kossmann, E.L. Wimmers, and J. Yang, “Optimizing Queries across Diverse Data Sources,” Proc. Very Large Databases Conf., pp. 276-285, 1997.
[13] A.Y. HaLevy, “Answering Queries Using Views: A Survey,” VLDB J., vol. 10, no. 4, pp. 270-294, 2001.
[14] A. Jagatheensan and A. Rajasekar, “Data Grid Management Systems,” Proc. ACM SIGMOD Conf. , p. 683, 2003.
[15] A. Koeller and E.A. Rundensteiner, “Incremental Maintenance of Schema-Restructuring Views in SchemaSQL,” IEEE Trans. Knowledge and Data Eng., vol. 16, no. 9, pp. 1096-1111, 2004.
[16] A.M. Lee, A. Nica, and E.A. Rundensteiner, “The EVE Approach: View Synchronization in Dynamic Distributed Environments,” IEEE Trans. Knowledge and Data Eng., vol. 14, no. 5, pp. 931-945, 2002.
[17] M. Lenzerini, “Data Integration: A Theoretical Perspective,” Proc. Int'l Conf. Management of Data/Principles of Database Systems, pp. 233-246, 2002.
[18] A.Y. Levy, Z.G. Ives, P. Mork, and I. Tatarinov, “Piazza: Data Management Infrastructure for Semantic Web Applications,” Proc. 12th Int'l World Wide Web Conf., pp. 556-567, 2003.
[19] T. Ling and E. Sze, “Materialized View Maintenance Using Version Numbers,” Proc. Int'l Conf. Database Systems for Advanced Applications (DASFAA), pp. 263-270, 1999.
[20] J.J. Lu, G. Moerkotte, J. Schue, and V.S. Subrahmanian, “Efficient Maintenance of Materialized Mediated Views,” Proc. ACM SIGMOD Conf., pp. 340-351, May 1995.
[21] J. Madhaven, P.A. Bernstein, and E. Rahm, “Generic Schema Matching with Cupid,” Proc. Very Large Databases Conf., pp. 49-58, 2001.
[22] R.J. Miller, L.M. Haas, and M.A. Hernández, “Schema Mapping as Query Discovery,” Proc. Very Large Databases Conf., pp. 77-88, 2000.
[23] A. Nica, A.J. Lee, and E.A. Rundensteiner, “The CVS Algorithm for View Synchronization in Evolvable Large-Scale Information Systems,” Proc. Int'l Conf. Extending Database Technology (EDBT), pp. 359-373, 1998.
[24] A. Nica and E.A. Rundensteiner, “View Maintenance after View Synchronization,” Proc. Int'l Database Eng. and Applications Symp. (IDEAS), pp. 213-215, 1999.
[25] K. Salem, K.S. Beyer, R. Cochrane, and B.G. Lindsay, “How to Roll a Join: Asynchronous Incremental View Maintenance,” Proc. ACM SIGMOD Conf., pp. 129-140, 2000.
[26] R. Tarjan, “Depth-First Search and Linear Graph Algorithms,” SIAM J. Computing, vol. 1, no. 2, June 1972.
[27] Y. Velegrakis, R.J. Miller, and L. Popa, “Mapping Adaptation under Evolving Schemas,” Proc. Very Large Databases Conf., pp. 584-595, 2003.
[28] C. Yu and L. Popa, “Semantic Adaptation of Schema Mapping after Schemas Evolve,” Proc. Very Large Databases Conf., pp. 1006-1017, 2005.
[29] X. Zhang and E.A. Rundensteiner, “Integrating Maintenance and Synchronization of Data Warehouses Using a Cooperative Framework,” Information Systems, vol. 27, no. 4, pp. 219-243, 2002.
[30] Y. Zhuge, H. García-Molina, J. Hammer, and J. Widom, “View Maintenance in a Warehousing Environment,” Proc. ACM SIGMOD Conf., pp. 316-327, May 1995.
[31] Y. Zhuge, H. García-Molina, and J.L. Wiener, “The Strobe Algorithms for Multi-Source Warehouse Consistency,” Proc. Conf. Parallel and Distributed Information Systems, pp. 146-157, Dec. 1996.

Index Terms:
View maintenance, view synchronization, view adaptation, concurrency control, view maintenance anomaly.
Songting Chen, Xin Zhang, Elke A. Rundensteiner, "A Compensation-Based Approach for View Maintenance in Distributed Environments," IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 8, pp. 1068-1081, Aug. 2006, doi:10.1109/TKDE.2006.117
Usage of this product signifies your acceptance of the Terms of Use.