The Community for Technology Leaders
RSS Icon
Subscribe
Big Island, HI, USA
Jan. 6, 2003 to Jan. 9, 2003
ISBN: 0-7695-1874-5
pp: 108a
Udo Hahn , Freiburg University
Martin Honeck , Freiburg University Hospital
Stefan Schulz , Freiburg University Hospital
ABSTRACT
Document retrieval in languages with a rich and complex morphology — particularly in terms of derivation and (single-word) composition — suffers from serious performance degradation with the stemming-only query-term-to-text-word matching paradigm. We propose an alternative approach in which morphologically complex word forms are segmented into relevant subwords (such as stems, prefixes, suffixes), and subwords constitute the basic unit for indexing and retrieval. We evaluate our approach on a large biomedical document collection.
INDEX TERMS
null
CITATION
Udo Hahn, Martin Honeck, Stefan Schulz, "Subword-Based Text Retrieval", HICSS, 2003, 36th Hawaii International Conference on Systems Sciences, 36th Hawaii International Conference on Systems Sciences 2003, pp. 108a, doi:10.1109/HICSS.2003.1174249
16 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool