Automatic Disambiguation of Latin Abbreviations in Early Modern Texts for Humanities Digital Libraries
Houston, Texas USA
May 27, 2003 to May 31, 2003
Jeffrey A. Rydberg-Cox , University of Missouri at Kansas City
Early modern books written in Latin contain many abbreviations of common words that are derived from earlier manuscript practice. While these abbreviations are usually easily deciphered by a reader well-versed in Latin, they pose technical problems for full text digitization: they are difficult to OCR or have typed and — if they are not expanded correctly — they limit the effectiveness of information retrieval and reading support tools in the digital library. In this paper, I will describe a method for the automatic expansion and disambiguation of these abbreviations.
Content, Collection Development, Document Processing, Algorithms, Experimentation, Languages, Digitization, Tagging Early Modern Texts, History of Science
Jeffrey A. Rydberg-Cox, "Automatic Disambiguation of Latin Abbreviations in Early Modern Texts for Humanities Digital Libraries", JCDL, 2003, Digital Libraries, Joint Conference on, Digital Libraries, Joint Conference on 2003, pp. 372, doi:10.1109/JCDL.2003.1204892