Fourth Annual ACIS International Conference on Computer and Information Science (ICIS'05)
Acquiring Dominant Compound Terms to Build Korean Domain Knowledge Bases
Jeju Island, South Korea
July 14-July 16
ISBN: 0-7695-2296-3
Compound terms should be well ranked to reduce laborious work for building domain knowledge bases such as term dictionary and thesaurus. Especially, dominant terms in recent years are valuable in the aspects of coverage and reference. We adopt linguistic filtering using a part-of-speech filter and four combination rules to extract Korean compound terms. Domain seed terms are used to obtain their related terms from the above extracted term list. Term ranking, which considers the dominance trend of terms from several year data, assigns term dominance values to the related terms. Experimental results show that our ranking scheme adequately distributes extracted terms than term frequency ordering to reduce the effort of building domain knowledge bases in the manner of term clustering in three groups; growing, declining, and steady.
Index Terms:
Term Extraction, Term Ranking, Term Dominance Trend, Term Dominance Value
Citation:
Hanmin Jung, HeeKwan Koo, Byeong-Hee Lee, Won-Kyung Sung, "Acquiring Dominant Compound Terms to Build Korean Domain Knowledge Bases," icis, pp.2-7, Fourth Annual ACIS International Conference on Computer and Information Science (ICIS'05), 2005