This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes
October 2006 (vol. 28 no. 10)
pp. 1584-1601
Viewpoint independent recognition of free-form objects and their segmentation in the presence of clutter and occlusions is a challenging task. We present a novel 3D model-based algorithm which performs this task automatically and efficiently. A 3D model of an object is automatically constructed offline from its multiple unordered range images (views). These views are converted into multidimensional table representations (which we refer to as tensors). Correspondences are automatically established between these views by simultaneously matching the tensors of a view with those of the remaining views using a hash table-based voting scheme. This results in a graph of relative transformations used to register the views before they are integrated into a seamless 3D model. These models and their tensor representations constitute the model library. During online recognition, a tensor from the scene is simultaneously matched with those in the library by casting votes. Similarity measures are calculated for the model tensors which receive the most votes. The model with the highest similarity is transformed to the scene and, if it aligns accurately with an object in the scene, that object is declared as recognized and is segmented. This process is repeated until the scene is completely segmented. Experiments were performed on real and synthetic data comprised of 55 models and 610 scenes and an overall recognition rate of 95 percent was achieved. Comparison with the spin images revealed that our algorithm is superior in terms of recognition rate and efficiency.

[1] A.P. Ashbrook, R.B. Fisher, C. Robertson, and N. Werghi, “Finding Surface Correspondence for Object Recognition and Registration Using Pairwise Geometric Histograms,” Int'l J. Pattern Recognition and Artificial Intelligence, vol. 2, pp. 674-686, 1998.
[2] S. Avidan, Y. Moses, and Y. Moses, “Probabilistic Multi-View Correspondence in a Distributed Setting with No Central Server,” Proc. European Conf. Computer Vision, vol. 4, pp. 428-441, 2004.
[3] P. Besl, Machine Vision for Three-Dimensional Scenes. pp. 25-71, Academic Press, 1990.
[4] P.J. Besl and N.D. McKay, “Reconstruction of Real-World Objects via Simultaneous Registration and Robust Combination of Multiple Range Images,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, no. 2, pp. 239-256, Feb. 1992.
[5] R.J. Campbell and P.J. Flynn, “A WWW-Accessible 3D Image and Model Database for Computer Vision Research,” Empirical Evaluation Methods in Computer Vision, pp. 148-154, 1998.
[6] R.J. Campbell and P.J. Flynn, “A Survey of Free-Form Object Representation and Recognition Techniques,” Computer Vision and Understanding, vol. 81, no. 2, pp. 166-210, 2001.
[7] O. Carmichael, D. Huber, and M. Hebert, “Large Data Sets and Confusing Scenes in 3-D Surface Matching and Recognition,” Proc. Int'l Conf. 3-D Digital Imaging and Modeling, pp. 358-367, 1999.
[8] C. Chen, Y. Hung, and J. Cheng, “RANSAC-Based DARCES: A New Approach to Fast Automatic Registration of Partially Overlapping Range Images,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, no. 11, pp. 1229-1234, Nov. 1991.
[9] C.S. Chua and R. Jarvis, “3D Free-Form Surface Registration and Object Recognition,” Int'l J. Computer Vision, vol. 17, pp. 77-99, 1996.
[10] C.S. Chua and R. Jarvis, “Point Signatures: A New Representation for 3D Object Recognition,” Int'l J. Computer Vision, vol. 25, no. 1, pp. 63-85, 1997.
[11] B. Curless and M. Levoy, “A Volumetric Method for Building Complex Models from Range Images,” Proc. SIGGRAPH Conf., pp. 303-312, 1996.
[12] R. Donamukkala, D. Huber, A. Kapuria, and M. Hebert, “Automatic Class Selection and Prototyping for 3-D Object Classification,” Proc. Int'l Conf. 3-D Digital Imaging and Modeling, pp. 64-71, 2005.
[13] C. Dorai and A.K. Jain, “COSMOS: A Representation Scheme for 3D Free-Form Objects,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 10, pp. 1115-1130, Oct. 1997.
[14] P.J. Flynn and A.K. Jain, “CAD-Based Computer Vision: From CAD Models to Relational Graphs,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 2, pp. 114-132, Feb. 1991.
[15] J. Foley, A. van Dam, S. Feiner, and J. Hughes, Computer Graphics-Principles and Practice. Addison-Wesley, 1990.
[16] M. Garland and P.S. Heckbert, “Surface Simplification Using Quadric Error Metrics,” Proc. SIGGRAPH Conf., pp. 209-216, 1997.
[17] M. Hebert, K. Ikeuchi, and H. Delingette, “A Spherical Representation for Recognition of Free-Form Surfaces,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 7, pp. 681-690, July 1995.
[18] G. Hetzel, B. Leibe, P. Levi, and B. Schiele, “3D Object Recognition from Range Images Using Local Feature Histograms,” Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 394-399, 2001.
[19] K. Higuchi, M. Hebert, and K. Ikeuchi, “Building 3-D Models from Unregistered Range Images,” Proc. IEEE Int'l Conf. Robotics and Automation, vol. 3, pp. 2248-2253, 1994.
[20] D. Huber and M. Hebert, “3D Modeling Using a Statistical Sensor Model and Stochastic Search,” Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, pp. 858-865, 2003.
[21] D. Huber, A. Kapuria, R. Donamukkala, and M. Hebert, “Parts-Based 3D Object Recognition,” Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 82-89, 2004.
[22] A.E. Johnson and M. Hebert, “Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, no. 5, pp. 674-686, May 1999.
[23] T. Joshi, J. Ponce, B. Vijayakumar, and D. Kriegman, “Hot Curves for Modeling and Recognition of Smooth Curved 3D Objects,” Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, pp. 876-880, 1994.
[24] S.B. Kang and K. Ikeuchi, “The Complex EGI: A New Representation for 3D Pose Determination,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, pp. 707-721, 1993.
[25] Y. Lamdan and H. Wolfson, “Geometric Hashing: A General and Efficient Model-Based Recognitin Scheme,” Proc. IEEE Int'l Conf. Computer Vision, pp. 238-249, 1988.
[26] W. Lorensen and H. Cline, “A High Resolution 3D Surface Construction Algorithm,” Proc. SIGGRAPH Conf., pp. 163-169, 1987.
[27] G. Mamic and M. Bennamoun, “Representation and Recognition of Free-Form Objects,” Digital Signal Processing, vol. 12, pp. 47-76, 2002.
[28] “Mesh Tool Box,” Vision and Mobile Robotics Laboratory, Carnegie Mellon Univ., http://www-2.cs.cmu.edu/~vmr/software/mesh toolbox downloads.html, 2004.
[29] A.S. Mian, M. Bennamoun, and R.A. Owens, “From Unordered Range Images to 3D Models: A Fully Automatic Multiview Correspondence Algorithm,” Theory and Practice of Computer Graphics, pp. 162-166, 2004.
[30] A.S. Mian, M. Bennamoun, and R.A. Owens, “Automatic Multiview Coarse Registration of Range Images for 3D Modeling,” Proc. IEEE Conf. Cybernetics and Intelligent Systems, vol. 1, pp. 158-163, 2004.
[31] A.S. Mian, M. Bennamoun, and R.A. Owens, “A Novel Algorithm for Automatic 3D Model-Based Free-Form Object Recognition,” Proc. IEEE Int'l Conf. Systems, Man, and Cybernetics, vol. 7, pp. 6348-6353, 2004.
[32] A.S. Mian, M. Bennamoun, and R.A. Owens, “3D Recognition and Segmentation of Objects in Cluttered Scenes,” Proc. IEEE Workshop Applications of Computer Vision, vol. 1, pp. 8-13, 2005.
[33] A.S. Mian, M. Bennamoun, and R.A. Owens, “Automatic Correspondence for 3D Modeling: An Extensive Review,” Int'l J. Shape Modeling, 2005.
[34] A.S. Mian, M. Bennamoun, and R.A. Owens, “A Novel Representation and Feature Matching Algorithm for Automatic Pairwise Registration of Range Images,” Int'l J. Computer Vision, vol. 66, no. 1, pp. 19-40, 2006.
[35] T. Oishi, R. Sagawa, A. Nakazawa, R. Kurazume, and K. Ikeuchi, “Parallel Alignment of a Large Number of Range Images,” Proc. Int'l Conf. 3-D Digital Imaging and Modeling, pp. 195-202, 2003.
[36] S. Rusinkiewicz and M. Levoy, “Efficient Variants of the ICP Algorithm,” Proc. Int'l Conf. 3-D Digital Imaging and Modeling, pp. 145-152, 2001.
[37] Y. Shan, B. Matei, H.S. Sawhney, R. Kumar, D. Huber, and M. Hebert, “Linear Model Hashing and Batch RANSAC for Rapid and Accurate Object Recognition,” Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 121-128, 2004.
[38] “Software Packages,” Stanford Computer Graphics Laboratory, http://graphics.stanford.edusoftware/, 2005.
[39] F. Stein and G. Medioni, “Structural Indexing: Efficient 3-D Object Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, no. 2, pp. 125-145, Feb. 1992.
[40] J. Wand and F.S. Cohen, “Part II: 3-D Object Recognition and Shape Estimation from Image Contours Using B-Splines, Shape Invariant Matching, and Neural Network,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 16, no. 1, pp. 13-23, Jan. 1994.
[41] J. Williams and M. Bennamoun, “Simultaneous Registration of Multiple Corresponding Point Sets,” Computer Vision and Understanding, vol. 81, no. 1, pp. 117-142, 2001.
[42] J.V. Wyngaerd, L.V. Gool, R. Koth, and M. Proesmans, “Invariant-Based Registration of Surface Patches,” Proc. IEEE Int'l Conf. Computer Vision, vol. 1, pp. 301-306, 1999.

Index Terms:
Multiview correspondence, registration, 3D object recognition, segmentation, 3D representation, shape descriptor, geometric hashing.
Citation:
Ajmal S. Mian, Mohammed Bennamoun, Robyn Owens, "Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 10, pp. 1584-1601, Oct. 2006, doi:10.1109/TPAMI.2006.213
Usage of this product signifies your acceptance of the Terms of Use.