The Community for Technology Leaders
2013 IEEE 29th International Conference on Data Engineering (ICDE) (2013)
Brisbane, Australia Australia
Apr. 8, 2013 to Apr. 12, 2013
ISSN: 1063-6382
ISBN: 978-1-4673-4909-3
pp: 194-205
Taesung Lee , POSTECH, Pohang, South Korea
Zhongyuan Wang , Sch. of Inf., Renmin Univ. of China, Beijing, China
Haixun Wang , Microsoft Res. Asia, Beijing, China
Seung-won Hwang , POSTECH, Pohang, South Korea
ABSTRACT
Knowledge bases, which consist of concepts, entities, attributes and relations, are increasingly important in a wide range of applications. We argue that knowledge about attributes (of concepts or entities) plays a critical role in inferencing. In this paper, we propose methods to derive attributes for millions of concepts and we quantify the typicality of the attributes with regard to their corresponding concepts. We employ multiple data sources such as web documents, search logs, and existing knowledge bases, and we derive typicality scores for attributes by aggregating different distributions derived from different sources using different methods. To the best of our knowledge, ours is the first approach to integrate concept- and instance-based patterns into probabilistic typicality scores that scale to broad concept space. We have conducted extensive experiments to show the effectiveness of our approach.
INDEX TERMS
Knowledge based systems, Companies, Probabilistic logic, Sociology, Statistics, Data mining, Syntactics
CITATION
Taesung Lee, Zhongyuan Wang, Haixun Wang, Seung-won Hwang, "Attribute extraction and scoring: A probabilistic approach", 2013 IEEE 29th International Conference on Data Engineering (ICDE), vol. 00, no. , pp. 194-205, 2013, doi:10.1109/ICDE.2013.6544825
174 ms
(Ver 3.3 (11022016))