Issue No.02 - March/April (2003 vol.15)
<p><b>Abstract</b>—The availability of automatic tools for inferring semantics of database schemes is useful to solve several database design problems such as, that of obtaining Cooperative Information Systems or Data Warehouses from large sets of data sources. In this context, a main problem is to single out similarities or dissimilarities among scheme objects (interscheme properties). This paper presents graph-based techniques for a uniform derivation of interscheme properties including synonymies, homonymies, type conflicts, and subscheme similarities. These techniques are characterized by a common core: the computation of maximum weight matchings on some bipartite weighted graphs derived using a suitable metrics to measure semantic closeness of objects. The techniques have been implemented in a system prototype. Several experiments conducted with it, and (in part) accounted for in the paper, confirmed the effectiveness of our approach.</p>
Synonymies, homonymies, type conflicts, subscheme similarities, derivation of database semantics, heterogeneous databases, database interoperability.
Domenico Saccà, Luigi Palopoli, Domenico Ursino, "Uniform Techniques for Deriving Similarities of Objects and Subschemes in Heterogeneous Databases", IEEE Transactions on Knowledge & Data Engineering, vol.15, no. 2, pp. 271-294, March/April 2003, doi:10.1109/TKDE.2003.1185834