2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) (2017)
Kansas City, MO, USA
Nov. 13, 2017 to Nov. 16, 2017
Zhe He , School of Information, Florida State University, Tallahassee, FL
Yehoshua Perl , Department of Computer Science, New Jersey Institute of Tehnology, Newark, NJ
Gai Elhanan , Department of Computer Science, New Jersey Institute of Tehnology, Newark, NJ
Yan Chen , Department of Computer, Information Systems, BMCC, CUNY, New York, NJ
James Geller , Department of Computer Science, New Jersey Institute of Tehnology, Newark, NJ
Jiang Bian , Department of Health Outcomes and Policy, University of Florida, Gainesville, FL
The Unified Medical Language System (UMLS) is an important terminological system. By the policy of its curators, each concept of the UMLS should be assigned the most specific Semantic Types (STs) in the UMLS Semantic Network (SN). Hence, the Semantic Types of most UMLS concepts are assigned at or near the bottom (leaves) of the UMLS Semantic Network. While most ST assignments are correct, some errors do occur. Therefore, Quality Assurance efforts of UMLS curators for ST assignments should concentrate on automatically detected sets of UMLS concepts with higher error rates than random sets. In this paper, we investigate the assignments of top-level semantic types in the UMLS semantic network to concepts, identify potential erroneous assignments, define four categories of errors, and thus provide assistance to curators of the UMLS to avoid these assignments errors. Human experts analyzed samples of concepts assigned 10 of the top-level semantic types and categorized the erroneous ST assignments into these four logical categories. Two thirds of the concepts assigned these 10 top-level semantic types are erroneous. Our results demonstrate that reviewing top-level semantic type assignments to concepts provides an effective way for UMLS quality assurance, comparing to reviewing a random selection of semantic type assignments.
Unified modeling language, Semantics, Terminology, Quality assurance, Chemicals, Ontologies, Diseases
Z. He, Y. Perl, G. Elhanan, Y. Chen, J. Geller and J. Bian, "Auditing the assignments of top-level semantic types in the UMLS semantic network to UMLS concepts," 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Kansas City, MO, USA, 2017, pp. 1262-1269.