This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
The EVE Approach: View Synchronization in Dynamic Distributed Environments
September/October 2002 (vol. 14 no. 5)
pp. 931-954

Abstract—The construction and maintenance of data warehouses (views) in large-scale environments composed of numerous distributed and evolving information sources (ISs) such as the WWW has received great attention recently. Such environments are plagued with changing information because ISs tend to continuously evolve by modifying not only their content but also their query capabilities and interface and by joining or leaving the environment at any time. We are the first to introduce and address the problem of schema changes of ISs, while previous work in this area, such as incremental view maintenance, has mainly dealt with data changes at ISs. In this paper, we outline our solution approach to this challenging new problem of how to adapt views in such evolving environments. We identify a new view adaptation problem for view evolution in the context of ISs schema changes, which we call View Synchronization. We also outline the Evolvable View Environment (EVE) approach that we propose as framework for solving the view synchronization problem, along with our decisions concerning the key design issues surrounding EVE. The main contributions of this paper are: 1) we provide an E-SQL view definition language with which the view definer can direct the view evolution process, 2) we introduce a model for information source description which allows a large class of ISs to participate in our system dynamically, 3) we formally define what constitutes a legal view rewriting, 4) we develop replacement strategies for affected view components which are designed to meet the preferences expressed by E-SQL, 5) we prove the correctness of the replacement strategies, and 6) we provide a set of view synchronization algorithms based on those strategies. A prototype of our EVE system has successfully been built using Java, JDBC, Oracle, and MS Access.

[1] Y. Arens, C.A. Knoblock, and W.-M. Shen, “Query Reformulation for Dynamic Information Integration,” J. Intelligent Information Systems, special issue on intelligent information integration, vol. 6, nos. 2/3, pp. 99–130, 1996.
[2] J.A. Blakeley, N. Coburn, and P.-A. Larson, “Updating Derived Relations: Detecting Irrelevant and Autonomously Computable Updates,” ACM Trans. Database Systems, vol. 14, no. 3, pp. 369-400, Sept. 1989.
[3] M.J. Carey, L.M. Haas, P.M. Schwarz, M. Arya, W.F. Cody, R. Fagin, M. Flickner, A.W. Luniewski, W. Niblack, D. Petkovic, J.H. Williams, J. Thomas, and E.L. Wimmers, “Towards Heterogeneous Multimedia Information Systems: The Garlic Approach,” Proc. Fifth Int'l Workshop Research Issues in Data Eng. (RIDE): Distributed Object Management, 1995.
[4] W.W. Chu, M.A. Merzbacher, and L. Berkovich, “The Design and Implementation of CoBase,” SIGMOD Record, vol. 22, no. 2, pp. 517-522, June 1993.
[5] S. Cohen, W. Nutt, and A. Serebrenik, “Rewriting Aggregate Queries Using Views,” Proc. ACM Symp. Principles of Database Systems, C. Papadimitriou, ed., May 1999.
[6] W. Chu, H. Yang, K. Chiang, M. Minock, G. Chow, and C. Larson, “Cobase: A Scalable and Extensible Cooperative Information System,” J. Intelligent Information Systems, vol. 6, pp. 223-259, May 1996.
[7] O. Etzioni and D. Weld, “A Softbot‐Based Interface to the Internet,” Comm. ACM, Vol. 37, No. 7, July 1994, pp. 72‐76.
[8] D. Florescu, L. Raschid, and P. Valduriez, “Using Heterogenous Equivalence for Query Rewriting in Multidatabase Systems,” Proc. Third Int'l Conf. Cooperative Information Systems, 1995.
[9] A. Gupta, H.V. Jagadish, and I.S. Mumick, “Data Integration Using Self-Maintainable Views,” Proc. Int'l Conf. Extending Database Technology (EDBT), pp. 140-144, 1996.
[10] S. Galindo-Legaria, “Outerjoins as Disjunctions,” Proc. SIGMOD, pp. 348-358, 1994.
[11] M. Jarke, M.A. Jeusfeld, C. Quix, and P. Vassil-iadis, “Architecture and Quality in Data Warehouses: An Extended Repository Approach,” Information Systems, vol. 24, no. 3, pp. 229-253, 1999.
[12] A.J. Lee, A. Koeller, A. Nica, and E.A. Rundensteiner, “Data Warehouse Evolution: Trade-Offs between Quality and Cost of Query Rewritings,” Technical Report WPI-CS-TR-98-2, revised in 1999., Worcester Polytechnic Inst., Dept. of Computer Science, 1998.
[13] A.J. Lee, A. Koeller, A. Nica, and E.A. Rundensteiner, “Data Warehouse Evolution: Trade-Offs between Quality and Cost of Query Rewritings,” Proc. IEEE Int'l Conf. Data Eng., Poster Session p. 255, Mar. 1999.
[14] A.J. Lee, A. Koeller, A. Nica, and E.A. Rundensteiner, “Non-Equivalent Query Rewritings,” Proc. Int'l Database Conf., pp. 248-262, July 1999.
[15] A.Y. Levy, A.O. Mendelzon, and Y. Sagiv, “Answering Queries Using Views,” Proc. ACM Symp. Principles of Database Systems, pp. 95-104, May 1995.
[16] A. Levy, I.S. Mumick, Y. Sagiv, and O. Shmueli, “Equivalence, Query Reachability and Satisfiability in Datalog Extensions,” Proc. 12th ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems, pp. 109-122, May 1993.
[17] A.J. Lee, A. Nica, and E.A. Rundensteiner, “Keeping Virtual Information Resources Up and Running,” Proc. IBM Centre for Advanced Studies Conf. (CASCON '97), pp. 1-14, Nov. 1997. (Best paper award.)
[18] A.J. Lee, A. Nica, and E.A. Rundensteiner, “The EVE Framework: View Synchronization in Evolving Environments,” Technical Report WPI-CS-TR-97-4, Worcester Polytechnic Inst. Dept. of Computer Science, 1997.
[19] A.Y. Levy, A. Rajaraman, and J.D. Ullman, “Answering Queries Using Limited External Processors,” Proc. 15th ACM Symp. Principals of Database Systems (pods), pp. 227-237, June 1996.
[20] A. Levy and Y. Sagiv, “Constraints and Redundancy in Datalog,” Proc. 11th ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems, pp. 67-80, June 1992.
[21] A.Y. Levy, D. Srivastava, and T. Kirk, “Data Model and Query Evaluation in Global Information Systems,” J. Intelligent Information Systems, special issue on networked information discovery and retrieval, vol. 5, no. 2, 1995.
[22] L.V.S. Lakshmanan, F. Sadri, and I.N. Subramanian, “SchemaSQL—A Language for Interoperability in Relational Multi-Database Systems,” Proc. 22nd Int'l Conf. Very Large Data Bases, T.M. Vijayaraman et al., eds., pp. 239-250, Sept. 1996.
[23] C. Li et al., "Capability-based Mediation in TSIMMIS," ACM SIGMOD Record, Vol. 27, No. 2, 1998, pp. 564-566.
[24] M. Mohania and G. Dong, “Algorithms for Adapting Materialized Views in Data Warehouses,” Proc. Int'l Symp. Cooperative Database Systems for Advanced Applications, Dec. 1996.
[25] Y. Vassiliou and M. Jarke, “Data Warehouse Quality: A Review of the DWQ Project,” Proc. Second Conf. Information Quality, 1997.
[26] A. Nica, A.J. Lee, and E.A. Rundensteiner, “The Complex Substitution Algorithm for View Synchronization,” Technical Report WPI-CS-TR-97-8, Worcester Polytechnic Inst. Dept. of Computer Science 1997.
[27] A. Nica, A.J. Lee, and E.A. Rundensteiner, “The CVS Algorithm for View Synchronization in Evolvable Large-Scale Information Systems,” Proc. Int'l Conf. Extending Database Technology (EDBT '98), pp. 359-373, Mar. 1998.
[28] A. Nica and E.A. Rundensteiner, “Loosely-Specified Query Processing in Large-Scale Information Systems,” Int'l J. Cooperative Information Systems, vol. 7, no. 1, pp. 77-104, 1998.
[29] A. Nica and E.A. Rundensteiner, “On Translating Loosely-Specified Queries into Executable Plans in Large-Scale Information Systems,” Proc. Second IFCIS Int'l Conf. Cooperative Information Systems CoopIS '97, pp. 213-222, June 1997.
[30] Y. Papakonstantinou, H. García-Molina, and J. Ullman, “Medmaker: A Mediation System Based on Declarative Specifications,” Proc. IEEE Int'l Conf. Data Eng., 1996.
[31] Y. Papakonstantinuo, H. Garcia-Molina, and J. Widom, “Object Exchange Across Heterogeneous Information Sources,” Proc. Int'l Conf. Data Engineering, pp. 251–260, 1995.
[32] C. Quix, “Repository Support for Data Warehouse Evolution,” Proc. Int'l Workshop Design and Management of Data Warehouses (DMDW '99), pp. 4.1-4.9, June 1999.
[33] E.A. Rundensteiner, A. Koeller, X. Zhang, A. Lee, A. Nica, A. VanWyk, and Y. Li, “Evolvable View Environment,” Proc. SIGMOD'99 Demo Session, pp. 553-555, May/June 1999.
[34] E.A. Rundensteiner, A. Koeller, and X. Zhang, “Maintaining Data Warehouses over Changing Information Sources,” Comm. ACM, vol. 43, no. 6, pp. 57-62, June 2000.
[35] E.A. Rundensteiner, A.J. Lee, and A. Nica, “On Preserving Views in Evolving Environments,” Proc. Fourth Int'l Workshop Knowledge Representation Meets Databases (KRDB'97): Intelligent Access to Heterogeneous Information, pp. 13.1-13.11, Aug. 1997.
[36] Y.G. Ra and E.A. Rundensteiner, “A Transparent OO Schema Change Approach Using View Schema Evolution,” IEEE Int'l Conf. Data Eng., pp. 165-172, Mar. 1995.
[37] Y.G. Ra and E.A. Rundensteiner, “A Transparent Schema-Evolution System Based on Object-Oriented View Technology,” IEEE Trans. Knowledge and Data Eng., vol. 10, no. 4, July/Aug. 1998.
[38] A. Rajaraman, Y. Sagiv, and J.D. Ullman, “Answering Queries Using Templates With Binding Patterns,” Proc. ACM Symp. Principles of Database Systems, pp. 105-112, May 1995.
[39] A. Rajaraman and J.D. Ullman, “Integrating Information by Outerjoins and Full Disjunctions,” Proc. ACM Symp. Principles of Database Systems, pp. 238-248, 1996.
[40] D. Srivastava, S. Dar, H.V. Jagadish, and A.Y. Levy, “Answering Queries with Aggregation Using Views,” Proc. Int'l Conf. Very Large Data Bases, pp. 318-329, 1996.
[41] J. Ullman, Principles of Database and Knowledge-Base Systems, vol. 1. Computer Science Press, 1988.
[42] J. Widom, “Research Problems in Data Warehousing,” Proc. Int'l Conf. Information and Knowledge Management, pp. 25-30, Nov. 1995.
[43] Y. Zhuge, H. García-Molina, J. Hammer, and J. Widom, “View Maintenance in a Warehousing Environment,” Proc. SIGMOD, pp. 316-327, May 1995.
[44] X. Zhang and E.A. Rundensteiner, “Integrating the Maintenance and Synchronization of Data Warehouses Using a Cooperative Framework” Information Systems, vol. 27, no. 4, pp. 219-243, 2002.
[45] X. Zhang and E.A. Rundensteiner, “DyDa: Dynamic Data Warehouse Maintenance in a Fully Concurrent Environment,” Data Warehousing and Knowledge Discovery, Sept. 2000.

Index Terms:
Data warehouses, view maintenance, query rewriting, view adaptation, view synchronization, view definition language distributed, evolving information sources, and source evolution.
Citation:
Amy J. Lee, Anisoara Nica, Elke A. Rundensteiner, "The EVE Approach: View Synchronization in Dynamic Distributed Environments," IEEE Transactions on Knowledge and Data Engineering, vol. 14, no. 5, pp. 931-954, Sept.-Oct. 2002, doi:10.1109/TKDE.2002.1033766
Usage of this product signifies your acceptance of the Terms of Use.