loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
9th International Database Engineering & Application Symposium (IDEAS'05)
Evaluating and Improving Integration Quality for Heterogeneous Data Sources Using Statistical Analysis
Montreal, Canada
July 25-July 27
ISBN: 0-7695-2404-4
Evgeniya Altareva, University of Düsseldorf
Stefan Conrad, University of Düsseldorf
This paper considers the problem of integrating heterogeneous semi-structured data sources with the purpose of estimating integration quality (IQ). Integration of such data sources leads to results with unpredictable trustworthiness and none of the existing methods is capable of accounting for the uncertainty which is accumulated over all of the integration steps and which affects integration quality. To compute the uncertainties we suggest using a well-established statistical method Latent Class Analysis (LCA). This method allows to analyze the influence of the latent factors associated with the real-world entities on the set of data. We show on examples how the proposed approach can be used for evaluating and improving IQ giving an important tool to the users concerned with the data?s trustworthiness.
Citation:
Evgeniya Altareva, Stefan Conrad, "Evaluating and Improving Integration Quality for Heterogeneous Data Sources Using Statistical Analysis," ideas, pp.406-414, 9th International Database Engineering & Application Symposium (IDEAS'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.