Issue No.07 - July (2012 vol.24)
Zhengrui Jiang , Iowa State University, Ames
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2011.75
One of the major challenges of data integration is to resolve conflicting numerical attribute values caused by data heterogeneity. In addressing this problem, existing approaches proposed in prior literature often ignore such data inconsistencies or resolve them in an ad hoc manner. In this study, we propose a decision-theoretical framework that resolves numerical value conflicts in a systematic manner. The framework takes into consideration the consequences of incorrect numerical values and selects the value that minimizes the expected cost of errors for all data application problems under consideration. Experimental results show that significant savings can be achieved by adopting the proposed framework instead of ad hoc approaches.
Database integration, heterogeneous databases, data heterogeneity, numerical value conflicts, probabilistic databases, Type I, Type II, and misrepresentation errors.
Zhengrui Jiang, "A Decision-Theoretic Framework for Numerical Attribute Value Reconciliation", IEEE Transactions on Knowledge & Data Engineering, vol.24, no. 7, pp. 1153-1169, July 2012, doi:10.1109/TKDE.2011.75