5th International Conference on Intelligent Systems Design and Applications (ISDA'05) Text Segmentation in Polish Wroclaw, Poland September 08-September 10 ISBN: 0-7695-2286-6
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ISDA.2005.89
In the paper a great importance of text segmentation in natural language engineering and in artificial intelligence systems has been pointed out. It has been shown that in Polish all punctuation marks that end sentences have also other functions in sentences. In this context various approaches to sentence boundary disambiguation have been presented. Taking features of Polish into consideration, text tokenization has been analysed. The direction of empirical research on Polish texts segmentation based on the analysis contained in this paper has been drawn. Also the list of Polish abbreviations that have the same spelling as some common words has been presented.
Citation:
Pawel P. Mazur, "Text Segmentation in Polish," isda, pp.43-48, 5th International Conference on Intelligent Systems Design and Applications (ISDA'05), 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||