CSDL Home IEEE Transactions on Pattern Analysis & Machine Intelligence 1983 vol.5 Issue No.06 - June
Issue No.06 - June (1983 vol.5)
Dana H. Ballard , Department of Computer Science, University of Rochester, Rochester, NY 14627.
An important problem in vision is to detect the presence of a known rigid 3-D object. The general 3-D object recognition task can be thought of as building a description of the object that must have at least two parts: 1) the internal description of the object itself (with respect to an object-centered frame); and 2) the transformation of the object-centered frame to the viewer-centered (image) frame. The reason for this decomposition is parsimony: different views of the object should have minimal impact on its description. This is achieved by factoring the object's description into two sets of parameters, one which is view-independent (the object-centered component) and one which is view-varying (the viewing transformation). Often a description of the object is known beforehand and the task reduces to finding the objectframe to viewer-frame transformation. This paper describes a method for handling this case: a known object is detected by finding changes in orientation, translation, and scale of the object from its canonical description. The method is a Hough technique and has the characteristic insensitivity to occlusion and noise.
Dana H. Ballard, "Viewer Independent Shape Recognition", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.5, no. 6, pp. 653-660, June 1983, doi:10.1109/TPAMI.1983.4767456