The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.03 - March (2012 vol.24)
pp: 506-519
Gao Cong , Nanyang Technological University, Singapore
Wenfei Fan , University of Ediburgh, Edinburgh
Floris Geerts , University of Edinburgh, Edinburgh
Jianzhong Li , Harbin Institue of Technology, Heilongjiang
Jizhou Luo , Harbin Institute of Technology, Heilongjiang
ABSTRACT
This paper investigates three problems identified in [1] for annotation propagation, namely, the view side-effect, source side-effect, and annotation placement problems. Given annotations entered for a tuple or an attribute in a view, these problems ask what tuples or attributes in the source have to be annotated to produce the view annotations. As observed in [1], these problems are fundamental not only for data provenance but also for the management of view updates. For an annotation attached to a single existing tuple in a view, it has been shown that these problems are often intractable even for views defined in terms of simple SPJU queries [1]. We revisit these problems by considering several dichotomies: 1) views defined in various subclasses of SPJU, versus SPJU views under a practical key preserving condition; 2) annotations attached to existing tuples in a view versus annotations on tuples to be inserted into the view; and 3) a single-tuple annotation versus a group of annotations. We provide a complete picture of intractability and tractability for the three problems in all these settings. We show that key preserving views often simplify the propagation analysis. Indeed, some problems become tractable for certain key preserving views, as opposed to the intractability of their counterparts that are not key preserving. However, group annotations often make the analysis harder. In addition, the problems have quite diverse complexity when annotations are attached to existing tuples in a view and when they are entered for tuples to be inserted into the view.
INDEX TERMS
Annotation, view updates, view maintenance, SPJU queries.
CITATION
Gao Cong, Wenfei Fan, Floris Geerts, Jianzhong Li, Jizhou Luo, "On the Complexity of View Update Analysis and Its Application to Annotation Propagation", IEEE Transactions on Knowledge & Data Engineering, vol.24, no. 3, pp. 506-519, March 2012, doi:10.1109/TKDE.2011.27
REFERENCES
[1] P. Buneman, S. Khanna, and W. Tan, "On Propagation of Deletion and Annotation through Views," Proc. ACM SIGMOD-SIGACT-SIGART Symp. Principles of Database Systems (PODS), pp. 150-158, 2002.
[2] P. Buneman, J. Cheney, W. Tan, and S. Vansummeren, "Curated Databases," Proc. ACM SIGMOD-SIGACT-SIGART Symp. Principles of Database Systems (PODS), pp. 1-12, 2008.
[3] J. Cheney, L. Chiticariu, and W.C. Tan, "Provenance in Databases: Why, How, and Where," Foundations and Trends in Databases, vol. 1, no. 4, pp. 379-474, 2009.
[4] W. Gatterbauer, M. Balazinska, N. Khoussainova, and D. Suciu, "Believe It or Not: Adding Belief Annotations to Databases," Proc. VLDB Endowment, vol. 2, no. 1, pp. 1-12, 2009.
[5] P. Buneman, S. Khanna, and W. Tan, "Why and Where: A Characterization of Data Provenance," Proc. Int'l Conf. Database Theory (ICDT), pp. 316-330, 2001.
[6] M.Y. Eltabakh, W.G. Aref, A.K. Elmagarmid, M. Ouzzani, and Y.N. Silva, "Supporting Annotations on Relations," Proc. Int'l Conf. Extending Database Technology (EDBT), pp. 379-390, 2009.
[7] J. Huang, T. Chen, A. Doan, and J.F. Naughton, "On the Provenance of Non-Answers to Queries over Extracted Data," Proc. VLDB Endowment, vol. 1, no. 1, pp. 736-747, 2008.
[8] D. Bhagwat, L. Chiticariu, G. Vijayvargiya, and W. Tan, "An Annotation Management System for Relational Databases," VLDB J., vol. 14, no. 4, pp. 373-396, 2005.
[9] Y. Cui, J. Widom, and J.L. Wiener, "Tracing the Lineage of View Data in a Warehousing Environment," ACM Trans. Database Systems, vol. 25, no. 2, pp. 179-227, 2000.
[10] Y. Cui and J. Widom, "Run-Time Translation of View Tuple Deletions Using Data Lineage," technical report, Stanford Univ., 2001.
[11] E. Rahm and H.H. Do, "Data Cleaning: Problems and Current Approaches," IEEE Data Eng. Bull., vol. 23, no. 4, pp. 3-13, Dec. 2000.
[12] W.C. Tan, "Containment of Relational Queries with Annotation Propagation," Proc. Int'l Conf. Data Base Programming Languages (DBPL), pp. 37-53, 2003.
[13] Annotation for the Semantic Web (Frontiers in Artificial Intelligence and Applications), S. Handschuh and S. Staab, eds. IOS Press, 2003.
[14] M. Agosti, N. Ferro, I. Frommholz, and U. Thiel, "Annotations in Digital Libraries and Collaboratories Facets, Models and Usage," Proc. European Conf. Research and Advanced Technology for Digital Libraries (ECDL), pp. 244-255, 2004.
[15] L. Chiticariu, W. Tan, and G. Vijayvargiya, "DBNotes: A Post-It System for Relational Databases Based on Provenance," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 942-944, 2005.
[16] F. Geerts, A. Kementsietsidis, and D. Milano, "$i$ MONDRIAN: A Visual Tool to Annotate and Query Scientific Databases," Proc. Int'l Conf. Extending Database Technology (EDBT), pp. 1168-1171, 2006.
[17] S. Abiteboul, R. Hull, and V. Vianu, Foundations of Databases. Addison-Wesley, 1995.
[18] U. Dayal and P.A. Bernstein, "On the Correct Translation of Update Operations on Relational Views," ACM Trans. Database Systems, vol. 7, no. 3, pp. 381-416, 1982.
[19] A. Keller, "Algorithms for Translating View Updates to Database Updates for Views Involving Selections, Projections, and Joins," Proc. ACM SIGMOD-SIGACT-SIGART Symp. Principles of Database Systems (PODS), pp. 154-163, 1985.
[20] S.S. Cosmadakis and C.H. Papadimitriou, "Updates of Relational Views," Proc. ACM SIGMOD-SIGACT-SIGART Symp. Principles of Database Systems (PODS), pp. 317-331, 1983.
[21] J. Lechtenborger and G. Vossen, "On the Computation of Relational View Complements," ACM Trans. Database Systems, vol. 28, no. 2, pp. 175-208, 2003.
[22] F. Bancilhon and N. Spyratos, "Update Semantics of Relational Views," ACM Trans. Database Systems, vol. 6, no. 4, pp. 557-575, 1981.
[23] G. Cong, W. Fan, and F. Geerts, "Annotation Propagation Revisited for Key Preserving View," Proc. Int'l Conf. Information and Knowledge Management (CIKM), pp. 632-641, 2006.
[24] F. Geerts, A. Kementsietsidis, and D. Milano, "MONDRIAN: Annotating and Querying Databases through Colors and Blocks," Proc. Int'l Conf. Data Eng. (ICDE), 2006.
[25] F. Geerts and J. Van den Bussche, "Relational Completeness of Query Languages for Annotated Databases," J. Computer and System Sciences, vol. 77, pp. 491-504, 2011.
[26] P. Buneman, J. Cheney, and S. Vansummeren, "On the Expressiveness of Implicit Provenance in Query and Update Languages," ACM Trans. Database Systems, vol. 33, no. 4, pp. 1-47, 2008.
[27] T.J. Green, G. Karvounarakis, and V. Tannen, "Provenance Semirings," Proc. ACM SIGMOD-SIGACT-SIGART Symp. Principles of Database Systems (PODS), pp. 31-40, 2007.
[28] Y.R. Wang and S.E. Madnick, "A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective," Proc. Int'l Conf. Very Large Databases (VLDB), pp. 519-538, 1990.
[29] P. Buneman, A. Chapman, and J. Cheney, "Provenance Management in Curated Databases," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 539-550, 2006.
[30] A. Bohannon, B. Pierce, and J.A. Vaughan, "Relational Lenses: A Language for Updateable Views," Proc. ACM SIGMOD-SIGACT-SIGART Symp. Principles of Database Systems (PODS), pp. 338-347, 2006.
[31] B. Choi, G. Cong, W. Fan, and S.D. Viglas, "Updating Recursive XML Views of Relations," J. Computer Science and Technology, vol. 23, no. 4, pp. 516-537, 2008.
[32] "IBM DB2 Universal Database SQL Reference," IBM, http://www.ibm.com/software/datadb2/, 2011.
[33] "SQL Reference," Oracle, http://www.oracle.com/technology/documentation database10g.html, 2011.
[34] "MSDN Library," SQL Server, http://msdn.microsoft.com library, 2011.
[35] M. Garey and D. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness. WH Freeman and Co., 1979.
[36] C.H Papadimitriou, Computational Complexity. Addison-Wesley, 1994.
37 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool