The Community for Technology Leaders
RSS Icon
Issue No.10 - Oct. (2012 vol.34)
pp: 1902-1914
Markus Ulrich , MVTec Software GmbH, Muenchen
Christian Wiedemann , MVTec Software GmbH, Muenchen
Carsten Steger , MVTec Software GmbH, Muenchen
This paper describes an approach for recognizing instances of a 3D object in a single camera image and for determining their 3D poses. A hierarchical model is generated solely based on the geometry information of a 3D CAD model of the object. The approach does not rely on texture or reflectance information of the object's surface, making it useful for a wide range of industrial and robotic applications, e.g., bin-picking. A hierarchical view-based approach that addresses typical problems of previous methods is applied: It handles true perspective, is robust to noise, occlusions, and clutter to an extent that is sufficient for many practical applications, and is invariant to contrast changes. For the generation of this hierarchical model, a new model image generation technique by which scale-space effects can be taken into account is presented. The necessary object views are derived using a similarity-based aspect graph. The high robustness of an exhaustive search is combined with an efficient hierarchical search. The 3D pose is refined by using a least-squares adjustment that minimizes geometric distances in the image, yielding a position accuracy of up to 0.12 percent with respect to the object distance, and an orientation accuracy of up to 0.35 degree in our tests. The recognition time is largely independent of the complexity of the object, but depends mainly on the range of poses within which the object may appear in front of the camera. For efficiency reasons, the approach allows the restriction of the pose range depending on the application. Typical runtimes are in the range of a few hundred ms.
Solid modeling, Three dimensional displays, Cameras, Computational modeling, Image edge detection, Robustness, Design automation, perspective., 3D object recognition, machine vision, aspect graphs, similarity measures, hierarchical, models, robotics, industrial automation, image generation, projections, shape, feature measurement, least squares methods
Markus Ulrich, Christian Wiedemann, Carsten Steger, "Combining Scale-Space and Similarity-Based Aspect Graphs for Fast 3D Object Recognition", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 10, pp. 1902-1914, Oct. 2012, doi:10.1109/TPAMI.2011.266
[1] W.E.L. Grimson and D.P. Huttenlocher, "On the Verification of Hypothesized Matches in Model-Based Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 12, pp. 1201-1213, Dec. 1991.
[2] S. Lanser, O. Munkelt, and C. Zierl, "Robust Video-Based Object Recognition Using CAD Models," Proc. Intelligent Autonomous Systems, pp. 529-536, 1995.
[3] J.H.M. Byne and J.A.D.W. Anderson, "A CAD Based Computer Vision System," Image and Vision Computing, vol. 16, no. 8, pp. 533-539, June 1998.
[4] H. Borotschnig, L. Paletta, M. Prantl, and A. Prinz, "Appearance Based Active Object Recognition," Image and Vision Computing, vol. 18, no. 9, pp. 715-727, June 2000.
[5] J.J. Koenderink and A.J. van Doorn, "The Singularities of the Visual Mapping," Biological Cybernetics, vol. 24, pp. 51-59, 1976.
[6] J.J. Koenderink and A.J. van Doorn, "The Internal Representation of Solid Shape with Respect to Vision," Biological Cybernetics, vol. 32, pp. 211-216, 1979.
[7] S.D. Roy, S. Chaudhury, and S. Banerjee, "Isolated 3-D Object Recognition through Next View Planning," IEEE Trans. Systems, Man, and Cybernetics, vol. 30, no. 1, pp. 67-76, Jan. 2000.
[8] R.D. Schiffenbauer, "A Survey of Aspect Graphs," Technical Report TR-CIS-2001-01, Dept. of Computer and Information Science, Polytechnic Univ. Brooklyn, New York, Feb. 2001.
[9] S.J.D.A.P. Pentland and A. Rosenfeld, "3-D Shape Recovery Using Distributed Aspect Matching," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, no. 2, pp. 174-198, Feb. 1992.
[10] C.M. Cyr and B.B. Kimia, "A Similarity-Based Aspect-Graph Approach to 3D Object Recognition," Int'l J. Computer Vision, vol. 57, no. 1, pp. 5-22, 2004.
[11] D.W. Eggert, K.W. Bowyer, C.R. Dyer, H.I. Christensen, and D.B. Goldgof, "The Scale Space Aspect Graph," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1114-1130, Nov. 1993.
[12] S.K. Nayar, S. Baker, and H. Murase, "Parametric Feature Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 471-477, 1996.
[13] C. Von Bank, D.M. Gavrila, and C. Wöhler, "A Visual Quality Inspection System Based on a Hierarchical 3D Pose Estimation Algorithm," Proc. DAGM Symp. Pattern Recognition, pp. 179-186, 2003.
[14] D.G. Lowe, "Three-Dimensional Object Recognition from Single Two-Dimensional Images," Artificial Intelligence, vol. 21, no. 3, pp. 335-395, 1987.
[15] D.G. Lowe, "Fitting Parametrized 3-D Models to Images," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 5, pp. 441-450, May. 1991.
[16] M. Zerroug and R. Nevatia, "Using Invariance and Quasi-invariance for the Segmentation and Recovery of Curved Objects," Proc. Second Joint European US Worksop Applications of Invariance in Computer Vision, J.L. Mundy, A. Zisserman, and D.A. Forsyth, eds., vol. 825, pp. 317-340, 1994.
[17] F. Jurie, "Tracking Objects with a Recognition Algorithm," Pattern Recognition Letters, vol. 19, pp. 331-340, Mar. 1998.
[18] M.S. Costa and L.G. Shapiro, "3D Object Recognition and Pose with Relational Indexing," Computer Vision and Image Understanding, vol. 79, no. 3, pp. 364-407, Sept. 2000.
[19] P. David and D. DeMenthon, "Object Recognition in High Clutter Images Using Line Features," Proc. 10th Int'l Conf. Computer Vision, pp. 1581-1588, 2005.
[20] A. Zisserman, D. Forsyth, J. Mundy, C. Rothwell, J. Liu, and N. Pillow, "3D Object Recognition Using Invariance," Artificial Intelligence, vol. 78, pp. 239-288, 1995.
[21] B. Vijayakumar, D. Kriegman, and J. Ponce, "Invariant-Based Recognition of Complex Curved 3D Objects from Image Contours," Computer Vision and Image Understanding, vol. 72, no. 3, pp. 287-303, Dec. 1998.
[22] I. Weiss and M. Ray, "Model-Based Recognition of 3D Objects from Single Images," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 2, pp. 116-128, Feb. 2001.
[23] R. Strzodka, I. Ihrke, and M. Magnor, "A Graphics Hardware Implementation of the Generalized Hough Transform for Fast Object Recognition, Scale, and 3D Pose Detection," Proc. Int'l Conf. Image Analysis and Processing, pp. 188-193, 2003.
[24] S. Belongie, J. Malik, and J. Puzicha, "Shape Matching and Object Recognition Using Shape Contexts," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 4, pp. 509-522, Apr. 2002.
[25] D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[26] A.C. Berg, T.L. Berg, and J. Malik, "Shape Matching and Object Recognition Using Low Distortion Correspondences," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 26-33, 2005.
[27] J. Pilet, V. Lepetit, and P. Fua, "Real-Time Non-Rigid Surface Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 822-828, 2005.
[28] H. Bay, T. Tuytelaars, and L.V. Gool, "SURF: Speeded Up Robust Features," Proc. Ninth European Conf. Computer Vision, pp. 404-417, 2006.
[29] V. Lepetit, "Keypoint Recognition Using Randomized Trees," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 9, pp. 1465-1479, Sept. 2006.
[30] S. Hinterstoisser, S. Benhimane, and N. Navab, "N3M: Natural 3D Markers for Real-Time Object Detection and Pose Estimation," Proc. 11th IEEE Int'l Conf. Computer Vision, 2007.
[31] H. Kollnig and H.-H. Nagel, "3D Pose Estimation by Directly Matching Polyhedral Models to Gray Value Gradients," Int'l J. Computer Vision, vol. 23, no. 3, pp. 283-302, 1997.
[32] T. Drummond and R. Cipolla, "Real-Time Visual Tracking of Complex Structures," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 932-946, July 2002.
[33] V. Lepetit and P. Fua, "Monocular Model-Based 3D Tracking of Rigid Objects: A Survey," Foundations and Trends in Computer Graphics and Vision, vol. 1, no. 1, pp. 1-89, 2005.
[34] C. Steger, "Occlusion, Clutter, and Illumination Invariant Object Recognition," Int'l Archives of Photogrammetry and Remote Sensing, vol. 34, part 3A, pp. 345-350, 2002.
[35] C. Wiedemann, M. Ulrich, and C. Steger, "Recognition and Tracking of 3D Objects," Proc. 30th DAGM Symp. Pattern Recognition, G. Rigoll, ed., pp. 132-141, 2008.
[36] M. Ulrich, C. Wiedemann, and C. Steger, "CAD-Based Recognition of 3D Objects in Monocular images," Proc. Int'l Conf. Robotics and Automation, pp. 1191-1198, 2009.
[37] C. Steger, M. Ulrich, and C. Wiedemann, Machine Vision Algorithms and Applications. Wiley-VCH, 2007.
[38] M. Ulrich, "Hierarchical Real-Time Recognition of Compound Objects in Images," PhD thesis, Fakultät für Bauingenieur- und Vermessungswesen, Technische Universität München, Deutsche Geodätische Kommision, Reihe C, Heft Nr. 568, München, 2003.
[39] D.M. Gavrila and V. Philomin, "Real-Time Object Detection for 'Smart' Vehicles," Proc. Seventh IEEE Int'l Conf. Computer Vision, vol. 1, pp. 87-93, 1999.
[40] S. Di Zenzo, "A Note on the Gradient of a Multi-Image," Computer Vision, Graphics, and Image Processing, vol. 33, pp. 116-125, 1986.
[41] C. Steger, "Subpixel-Precise Extraction of Lines and Edges," Int'l Archives of Photogrammetry and Remote Sensing, vol. 33, part B3, pp. 141-156, 2000.
[42] D.H. Ballard, "Generalizing the Hough Transform to Detect Arbitrary Shapes," Pattern Recognition, vol. 13, no. 2, pp. 111-122, 1981.
[43] D.P. Huttenlocher, G.A. Klanderman, and W.J. Rucklidge, "Comparing Images Using the Hausdorff Distance," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 9, pp. 850-863, Sept. 1993.
[44] M. Ulrich, C. Steger, and A. Baumgartner, "Real-Time Object Recognition Using a Modified Generalized Hough Transform," Pattern Recognition, vol. 36, no. 11, pp. 2557-2570, Nov. 2003.
[45] J.R. Kender and D.G. Freudenstein, "What Is a 'Degenerate' View?" Proc. 10th Int'l Joint Conf. Artificial Intelligence, pp. 801-804, 1998.
[46] S.J. Dickinson, D. Wilkes, and J.K. Tsotsos, "A Computational Model of View Degeneracy," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, no. 8, pp. 673-689, Aug. 1999.
32 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool