This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Using Object Deputy Model to Prepare Data for Data Warehousing
September 2005 (vol. 17 no. 9)
pp. 1274-1288
Providing integrated access to multiple, distributed, heterogeneous databases and other information sources has become one of the leading issues in database research and the industry. One of the most effective approaches is to extract and integrate information of interest from each source in advance and store them in a centralized repository (known as a data warehouse). When a query is posed, it is evaluated directly at the warehouse without accessing the original information sources. One of the techniques that this approach uses to improve the efficiency of query processing is materialized view(s). Essentially, materialized views are used for data warehouses, and various methods for relational databases have been developed. In this paper, we will first discuss an object deputy approach to realize materialized object views for data warehouses which can also incorporate object-oriented databases. A framework has been developed using Smalltalk to prepare data for data warehousing, in which an object deputy model and database connecting tools have been implemented. The object deputy model can provide an easy-to-use way to resolve inconsistency and conflicts while preparing data for data warehousing, as evidenced by our empirical study.

[1] A. Abello, J. Samos, and F. Saltor, “Understanding Analysis Dimensions in a Multidimensional Object-Oriented Model,” Proc. Third Int'l Workshop Design and Management of Data Warehouses (DMDW '01), June 2001.
[2] S. Abiteboul and A. Bonner, “Objects and Views,” Proc. Int'l Conf. Management of Data, pp. 238-247, 1991.
[3] R. Ahmed et al., “The Pegasus Heterogeneous Multidatabase System,” Computer, vol. 24, no. 12, pp. 19-27, Dec. 1991.
[4] E. Bertino, “A View Mechanism for Object-Oriented Databases,” Proc. Third Int'l Conf. Extending Database Technology, pp. 136-151, 1992.
[5] E. Bertino, “Application of Object-Oriented Technology to the Integration of Heterogeneous Database Systems,” J. Distributed and Parallel Databases, vol. 2, no. 4, pp. 343-370, 1994.
[6] D. Calvanese, G. De Giacomo, M. Lenzerini, D. Nardi, and R. Rosati, “Information Integration: Conceptual Modeling and Reasoning Support,” Proc. Sixth Int'l Conf. Cooperative Information Systems, pp. 280-291, 1998.
[7] M.J. Carey et al., “Towards Heterogeneous Multimedia Information Systems: The Garlic Approach,” Proc. Fifth Int'l Workshop Research Issues in Data Eng.: Distributed Object Management, pp. 124-131, 1995.
[8] S. Chaudhuri and U. Dayal, “An Overview of Data Warehousing and OLAP Technology,” ACM SIGMOD Record, vol. 26 no. 1, pp. 65-74, 1997.
[9] B. Czejdo and M.C. Taylor, “Integration of Database Systems Using an Object-Oriented Approach,” Proc. IEEE First Workshop Research Issues in Data Eng.Interoperability among Multidatabase Systems, pp. 30-37, 1991.
[10] J.C. Franchitti and R. King, “Amalgame: A Tool for Creating Interoperating Persistent, Heterogeneous Components,” Advanced Database Systems, pp. 313-36, 1993.
[11] G. Yu, K. Kaneko, G. Bai, and A. Makinouchi, “Transaction Management for a Distributed Object Storage System WAKASHIDesign, Implementation and Performance,” Proc. IEEE 12th Int'l Conf. Data Eng., pp. 460-468, 1996.
[12] V. Gopalkrishnan, Q. Li, and K. Karlapalem, “Semantic Query Optimization Based on Class Partitioning Techniques in an Object Relational Data Warehousing Environment,” Int'l J. Information Technology, vol. 7, no. 2, 2001.
[13] J. Han, S. Nishio, H. Kawano, and W. Wang, “Generalization-Based Data Mining in Object-Oriented Databases Using an Object Cube Model,” Data and Knowledge Eng., vol. 25, no. 1-2, pp. 55-97, 1998.
[14] S. Heiler and S. Zdonick, “Object Views: Extending the Vision,” Proc. IEEE Sixth Int'l Conf. Data Eng., pp. 86-93, 1990.
[15] R. Hull and G. Zhou, “A Framework for Supporting Data Integration Using the Materialized and Virtual Approaches,” SIGMOD Record, vol. 25, no. 2, pp. 481-92, 1996.
[16] IDC, “Data Warehousing Tools: Market Forecast and Analysis: 2000-2004,” IDC report no. 23712, 2004.
[17] K. Karlapalem, Q. Li, and C. Shum, “An Architectural Framework for Homogenizing Heterogeneous Legacy Databases,” SIGMOD Record, vol. 24, no. 1, pp. 15-20, 1995.
[18] M. Kaul, K. Drosten, and E.J. Neuhold, “ViewSystem: Integrating Heterogeneous Information Bases by Object-Oriented Views,” Proc. IEEE Sixth Int'l Conf. Data Eng., pp. 2-10, 1990.
[19] W. Kim, I. Choi, S. Gala, and M. Scheevel, “On Resolving Schematic Heterogeneity in Multidatabase Systems,” Distributed and Parallel Databases, vol. 1, no. 3, pp. 251-279, 1993.
[20] Knightsbridge Solutions LLC, “Top 10 Trends in Data Warehousing (White Paper),” http:/www.knightsbridge.com, 2005.
[21] W. Litwin, L. Mark, and N. Roussopoulos, “Interoperability of Multiple Autonomous Databases,” ACM Computing Surveys, vol. 22, no. 3, pp. 267-293, 1990.
[22] Y. Papakonstantinou, H. Garcia-Molina, and J. Widom, “Object Exchange across Heterogeneous Information Sources,” Proc. Int'l Conf. Data Eng., pp. 251-60, 1995.
[23] T.B. Pedersen, A. Shoshani, J. Gu, and C.S. Jensen, “Extending OLAP Querying to External Object Databases,“ Proc. Conf. Information and Knowledge Management, pp. 405-413, 2000.
[24] Z. Peng and Y. Kambayashi, “Deputy Mechanisms for Object-Oriented Databases,” Proc. IEEE 11th Int'l Conf. Data Eng., pp. 333-340, 1995.
[25] Z. Peng and Y. Kambayashi, “Handling Conflicts and Replication During Integration of Multiple Databases by Object Deputy Model,” Proc. 20th Int'l Conf. Conceptual Modeling (ER '01), pp. 285- 298, 2001.
[26] E.A. Rundensteiner, “MultiView: A Methodology for Supporting Multiple Views in Object-Oriented Databases,“ Proc. 18th Very Large Data Bases Conf., pp. 187-198, 1992.
[27] C.S. dos Santos, “Design and Implementation of Object-Oriented Views,“ Proc. Sixth Int'l Conf. Database and Expert Systems Applications, pp. 91-102, 1995.
[28] N. Stewart, “Data Warehousing and Business Intelligence Market Forecast 2001-2005,” Survey.com market forecast report, 2001.
[29] J. Trujillo, M. Palomar, J. Gomez, and I. Song, “Designing Data Warehouses with OO Conceptual Models,” Computer, vol. 34, no. 12, pp. 66-75, Dec. 2001.
[30] P. Vassiliadis and T.K. Sellis, “A Survey of Logical Models for OLAP Databases,” ACM SIGMOD Record, vol. 28, no. 4, pp. 64-69, 1999.
[31] J. Widom, “Research Problems in Data Warehousing,” Proc. Conf. Information and Knowledge Mnagement, pp. 25-30, 1995.
[32] J. Yang, K. Karlapalem, and Q. Li, “Algorithms for Materialized View Design in Data Warehousing Environment,” Proc. Int'l Conf. Very Large Data Bases (VLDB '97), pp. 137-146, 1997.

Index Terms:
Index Terms- Data preparation, data warehousing, data fusion/integration, object deputy model, conflict resolution, duplicate handling.
Citation:
Zhiyong Peng, Qing Li, Ling Feng, Xuhui Li, Junqiang Liu, "Using Object Deputy Model to Prepare Data for Data Warehousing," IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 9, pp. 1274-1288, Sept. 2005, doi:10.1109/TKDE.2005.154
Usage of this product signifies your acceptance of the Terms of Use.