Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06)
Knowledge Discovery across Documents through Concept Chain Queries
Hong Kong, China
December 18-December 22
ISBN: 0-7695-2702-7
This paper focuses on detecting links between two concepts across text documents (e.g. two persons). We interpret such a query as finding the most meaningful evidence trail across documents that connect these two concepts. Here we propose a fast and efficient algorithm to perform this task. It is based on the idea of hypothesis generation originated by Swanson called "complementary structures in disjoint literatures" (CSD). We adapted the technique by (i) developing an alternate method of generating semantic profiles and (ii) extending the technique to generate concept chains. Counterterrorism corpus is used to evaluate the performance of this approach and demonstrates the effectiveness of our algorithm.