16th International Conference on Pattern Recognition (ICPR'02) - Volume 3 Newspaper Headlines Extraction from Microfilm Images Quebec City, QC, Canada August 11-August 15 ISBN: 0-7695-1695-X
Automatic indexing is important for a digital library to provide digitized manuscripts of old document images and their electronic text. As an essential step in creating such a system, this paper discusses the issue of extracting headlines from old newspaper microfilms. Most research on document layout analysis has largely assumed relatively clean images. However microfilm images of old newspapers present a challenge. Such images are usually insufficiently illuminated and considerably dirty. To overcome the problem we propose a new effective method for separating characters from noisy background since conventional threshold selection techniques are inadequate to deal with these kinds of images. A Run Length Smearing Algorithm (RLSA) is applied in the headline extraction. Experiment shows that our approach has improved the recall, precision and combined rates.
Citation:
Qing Hong Liu, Chew Lim Tan, "Newspaper Headlines Extraction from Microfilm Images," icpr, vol. 3, pp.30208, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 3, 2002 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||