loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th International Conference on Data Engineering (ICDE'00)
Practical Lineage Tracing in Data Warehouses
San Diego, California
February 28-March 03
ISBN: 0-7695-0506-6
Yingwe Cui, Computer Science Department, Stanford University
Jennifer Widom, Computer Science Department, Stanford University
We consider the view data lineage problem in a warehousing environment: For a given data item in a materialized warehouse view, we want to identify the set of source data items that produced the view item. We formalize the problem, and we present a lineage tracing algorithm for relational views with aggregation. Based on our tracing algorithm, we propose a number of schemes for storing auxiliary views that enable consistent and efficient lineage tracing in a multi-source data warehouse.We report on a performance study of the various schemes, identifying which schemes perform best in which settings. Based on our results, we have implemented a lineage tracing package in the WHIPS data warehousing system prototype at Stanford. With this package, users can select view tuples of interest, then efficiently drill through to examine the exact source tuples that produced the view tuples of interest.
Index Terms:
data warehouse, data lineage, auxiliary view
Citation:
Yingwe Cui, Jennifer Widom, "Practical Lineage Tracing in Data Warehouses," icde, pp.367, 16th International Conference on Data Engineering (ICDE'00), 2000
Usage of this product signifies your acceptance of the Terms of Use.