Identifying approximately duplicate database records that refer to the same entity is essential for information integration. The authors review traditional approaches to solving this problem and present their recent experimental results on comparing, combining, and learning textual similarity measures for name matching.
Index Terms:
database integration, text mining, machine learning, similarity measures
Citation:
Mikhail Bilenko, Raymond Mooney, William Cohen, Pradeep Ravikumar, Stephen Fienberg, "Adaptive Name Matching in Information Integration," IEEE Intelligent Systems, vol. 18, no. 5, pp. 16-23, Sep./Oct. 2003, doi:10.1109/MIS.2003.1234765 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||