• Publication
  • 2004
  • Issue No. 6 - June
Automatic Writer Identification Using Connected-Component Contours and Edge-Based Features of Uppercase Western Script
June 2004 (vol. 26 no. 6)
pp. 787-798

Abstract—In this paper, a new technique for offline writer identification is presented, using connected-component contours (COCOCOs or CO^3s) in uppercase handwritten samples. In our model, the writer is considered to be characterized by a stochastic pattern generator, producing a family of connected components for the uppercase character set. Using a codebook of CO^3s from an independent training set of 100 writers, the probability-density function (PDF) of CO^3s was computed for an independent test set containing 150 unseen writers. Results revealed a high-sensitivity of the CO^3 PDF for identifying individual writers on the basis of a single sentence of uppercase characters. The proposed automatic approach bridges the gap between image-statistics approaches on one end and manually measured allograph features of individual characters on the other end. Combining the CO^3 PDF with an independent edge-based orientation and curvature PDF yielded very high correct identification rates.

Index Terms:
Writer identification, connected-component contours, edge-orientation features, stochastic allograph emission model.
Lambert Schomaker, Marius Bulacu, "Automatic Writer Identification Using Connected-Component Contours and Edge-Based Features of Uppercase Western Script," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 6, pp. 787-798, June 2004, doi:10.1109/TPAMI.2004.18
