loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Second International Conference on Document Image Analysis for Libraries (DIAL'06)
The Case of the Digitized Works at a National Digital Library
Lyon, France
April 27-April 28
ISBN: 0-7695-2531-8
Jose Borbinha, INESC-ID - Instituto de Engenharia de Sistemas e Computadores, Lisboa, Portugal
Joao Gil, INESC-ID - Instituto de Engenharia de Sistemas e Computadores, Lisboa, Portugal
Gilberto Pedrosa, INESC-ID - Instituto de Engenharia de Sistemas e Computadores, Lisboa, Portugal
Joao Penas, INESC-ID - Instituto de Engenharia de Sistemas e Computadores, Lisboa, Portugal
This paper describes the case of the processing of digitised works at the BND - National Digital Library, in Portugal. This initiative created half a million of digitized images, from 25.000 titles of physical items. These represent a very heterogeneous sample of historical or more relevant items (printed monographic and newspapers, maps, manuscripts, drawings, etc.). The digitisation resulted in TIFF files, which need to be automatically processed to create the technical metadata, apply image processing actions, OCR, word indexing, and create derived copies for access in PNG, JPG, GIF, and PDF, as also the master copies for each of those works, for preservation. That process is described in this paper. It is fully automated through several XML schemas for the control of the processes, description of the results (including the OCR outputs), descriptive metadata (in Dublin Core, MARC XML, etc.) and rights and structural metadata (in METS).
Citation:
Jose Borbinha, Joao Gil, Gilberto Pedrosa, Joao Penas, "The Case of the Digitized Works at a National Digital Library," dial, pp.116-125, Second International Conference on Document Image Analysis for Libraries (DIAL'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.