CSDL Home IEEE Transactions on Pattern Analysis & Machine Intelligence 1984 vol.6 Issue No.01 - January
Issue No.01 - January (1984 vol.6)
Jurgen Altmann , Arbeitsgruppe Angewandte Physik und Experimentelle Biophysik, Philipps-Universität Marburg, D-3550 Marburg, West Germany.
A size- and position-invariant description of an image function can be obtained via the absolute value of the Mellin transform of its Fourier amplitude spectrum. If the transform is implemented on a digital computer via a discrete Fourier-Mellin transform, these exact invariances are not preserved due to sampling-and border-effects. In this paper these effects are discussed, and an alternative correlation method is proposed. The method consists of calculating the normalized absolute magnitude of the discrete Fourier transform (DFT) of the image function (which gives invariance to translation and multiplicative amplitude changes) and a subsequent logarithmic distortion in x- and y- direction, which converts scaling to translation. Two such transforms are compared by calculating the normalized Euclidean distances between both for all possible relative shifts along the main diagonal. If, for some shift, the distance has a minimum below a similarity threshold, the underlying image functions will probably differ only by translation and scaling. The magnitude of this shift is related to the scale factor between the objects. Good separation between similar and nonsimilar objects is possible if two size criteria imposed by the DFT are met: the total object size must not exceed N/4 (N is the number of image points in each dimension), and object details have to be larger than about 4 image points. As a consequence, N increases with object complexity and desired scale range.
Jurgen Altmann, "A Fast Correlation Method for Scale-and Translation-Invariant Pattern Recognition", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.6, no. 1, pp. 46-57, January 1984, doi:10.1109/TPAMI.1984.4767474