Fifth IEEE International Workshop on Computer Architectures for Machine Perception (CAMP'00)
Handling Artifacts in Digitally Reproduced Documents
Padova, Italy
September 11-September 13
ISBN: 0-7695-0740-9
L. Cinque, Dept. of Inf. Sci., Rome Univ., Italy
The analysis of scanned documents is important in the construction of digital libraries and paperless offices. One significant challenge is coping with artifacts of photocopying and scanning. We present a series of simple techniques for handling these difficulties. Using 125 images of the University of Washington scanned documents database, we demonstrate the effectiveness of these methods in preparing the images for segmentation by a multiresolution algorithm.
Index Terms:
digital libraries; scanned documents; digital libraries; paperless offices; artifacts; photocopying; scanning; scanned documents database; segmentation; multiresolution algorithm
Citation:
L. Cinque, S. Levialdi, L. Lombardi, S. Tanimoto, "Handling Artifacts in Digitally Reproduced Documents," camp, pp.340, Fifth IEEE International Workshop on Computer Architectures for Machine Perception (CAMP'00), 2000