Los Angeles, CA
March 31, 2009 to April 2, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CSIE.2009.886
A large number of Deep Web data sources are only accessible through their query interfaces. For any domain of interest, there may be many such sources with varied query capabilities and content coverage.To obtain mass valuable information in deep Web, we need to integrate large heterogeneous information. Schema matching is a critical problem in the integration process. This paper propose a new holistic schema matching method based on data mining, named as Correlated-clustering, which mines positively correlated attributes to form potential attribute groups, and finds synonym attributes by clustering. we design experiments to implement mentioned algorithms and technology. Experimental results testify that our solution achieves accurately and effectively.
Fu Yuchen, Liu Quan, Xu Yunlong, Zhang Chao, Zhou Wenyun, Cui Zhiming, "Correlated-Clustering Frame: A Holistic Method of Deep Web Schema Matching Based on Data Mining", CSIE, 2009, 2009 WRI World Congress on Computer Science and Information Engineering, CSIE, 2009 WRI World Congress on Computer Science and Information Engineering, CSIE 2009, pp. 528-533, doi:10.1109/CSIE.2009.886