This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
36th Annual Hawaii International Conference on System Sciences (HICSS'03) - Track 4
Big Island, Hawaii
January 06-January 09
ISBN: 0-7695-1874-5
Udo Hahn, Freiburg University
Martin Honeck, Freiburg University Hospital
Stefan Schulz, Freiburg University Hospital
Document retrieval in languages with a rich and complex morphology — particularly in terms of derivation and (single-word) composition — suffers from serious performance degradation with the stemming-only query-term-to-text-word matching paradigm. We propose an alternative approach in which morphologically complex word forms are segmented into relevant subwords (such as stems, prefixes, suffixes), and subwords constitute the basic unit for indexing and retrieval. We evaluate our approach on a large biomedical document collection.
Citation:
Udo Hahn, Martin Honeck, Stefan Schulz, "Subword-Based Text Retrieval," hicss, vol. 4, pp.108a, 36th Annual Hawaii International Conference on System Sciences (HICSS'03) - Track 4, 2003
Usage of this product signifies your acceptance of the Terms of Use.