Computer Science and Information Engineering, World Congress on (2009)
Los Angeles, California USA
Mar. 31, 2009 to Apr. 2, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CSIE.2009.886
A large number of Deep Web data sources are only accessible through their query interfaces. For any domain of interest, there may be many such sources with varied query capabilities and content coverage.To obtain mass valuable information in deep Web, we need to integrate large heterogeneous information. Schema matching is a critical problem in the integration process. This paper propose a new holistic schema matching method based on data mining, named as Correlated-clustering, which mines positively correlated attributes to form potential attribute groups, and finds synonym attributes by clustering. we design experiments to implement mentioned algorithms and technology. Experimental results testify that our solution achieves accurately and effectively.
C. Zhiming, Z. Wenyun, F. Yuchen, Z. Chao, L. Quan and X. Yunlong, "Correlated-Clustering Frame: A Holistic Method of Deep Web Schema Matching Based on Data Mining," 2009 WRI World Congress on Computer Science and Information Engineering, CSIE(CSIE), Los Angeles, CA, 2009, pp. 528-533.