This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
VideoPlus: A Method for Capturing the Structure and Appearance of Immersive Environments
April-June 2002 (vol. 8 no. 2)
pp. 171-182

This paper presents a simple approach to capturing the appearance and structure of immersive scenes based on the imagery acquired with an omnidirectional video camera. The scheme proceeds by combining techniques from structure from motion with ideas from image based rendering. An interactive photogrammetric modeling scheme is used to recover the locations of a set of salient features in the scene (points and lines) from image measurements in a small set of keyframe images. The estimates obtained from this process are then used as a basis for estimating the position and orientation of the camera at every frame in the video clip. By augmenting the video sequence with pose information we provide the end user with the ability to index the video sequence spatially as opposed to temporally. This allows the user to explore the immersive scene by interactively selecting the desired viewpoint and viewing direction.

[1] T.E. Boult, “Remote Reality via Omni-Directional Imaging,” Conference Abstracts and Applications: SIGGRAPH '98, Computer Graphics, S. Grisson, J. McAndless, O. Ahmad, C. Stapleton, A. Newton, C. Pearce, R. Ulyate, and R. Parent, eds., pp. 253–253, July 1998.
[2] S.E. Chen, "QuickTime VR—An Image-Based Approach to Virtual Environment Navigation," Siggraph 95 Conf. Proc., ACM Press, New York, 1995, pp. 29-38.
[3] S. Coorg and S. Teller, “Automatic Extraction of Textured Vertical Facades from Pose Imagery,” technical report, MIT Computer Graphics Group, Jan. 1998.
[4] P.E. Debevec, C.J. Taylor, and J. Malik, “Modeling and Rendering Architecture from Photographs: A Hybrid Geometry- and Image-Based Approach,” Proc. SIGGRAPH '96, pp. 11-20, Aug. 1996.
[5] C. Geyer and K. Daniilidis, “Catadioptric Camera Calibration,” Proc. Int'l Conf. Computer Vision, pp. 398-404, 1999.
[6] S.J. Gortler, R. Grzeszczuk, R. Szeliski, and M.F. Cohen, “The Lumigraph,” Proc. SIGGRAPH '96, pp. 43-54, 1996.
[7] H. Ishiguro, T. Maeda, T. Miyashita, and S. Tsuji, “A Strategy for Acquiring an Environmental Model with Panoramic Sensing by a Mobile Robot,” Proc. IEEE Int'l Conf. Robotics and Automation, pp. 724-729, 1994.
[8] H. Ishiguro, K. Ueda, and S. Tsuji, “Omnidirectional Visual Information for Navigating a Mobile Robot,” Proc. IEEE Int'l Conf. Robotics and Automation, pp. 799-804, 1993.
[9] H. Ishiguro, M. Yamamoto, and S. Tsuji, Omni-Directional Stereo IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, no. 2, pp. 257-262, 1992.
[10] J.E. Dennis Jr. and R.B. Schnabel, Numerical Methods for Unconstrained Optimization and Nonlinear Equations. Prentice Hall, 1983.
[11] M. Levoy and P. Hanrahan, “Light Field Rendering,” Proc. SIGGRAPH '96, pp. 31-42, 1996.
[12] M. Lhuillier and L. Quan, “Image Interpolation by Joint View Triangulation,” Proc. Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 139-145, 1999.
[13] A. Lippman, “Movie Maps: An Application of the Optical Video-Disc to Computer Graphics,” Proc. SIGGRAPH, pp. 32-42, July 1980.
[14] B.D. Lucas and T. Kanade, “An Iterative Image Registration Technique with an Application to Stereo Vision,” Proc. Seventh Int'l Joint Conf. Artificial Intelligence, 1981.
[15] S.K. Nayar, “Catadioptric Omnidirectional Cameras,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 482-488, June 1997.
[16] H.-Y. Shum and L.-W. He, “Rendering with Concentric Mosaics,” Proc. SIGGRAPH '99, pp. 299-306, 1999.
[17] T. Svoboda, T. Pajdla, and V. Hlavac, “Epipolar Geometry for Panoramic Cameras,” Proc. European Conf. Computer Vision, pp. 218-232, 1998.
[18] R. Szeliski and H.-Y. Shum, “Creating Full View Panoramic Image Mosaics and Environments Maps,” Proc. Computer Graphics, Ann. Conf. Series, vol. 8, pp. 251-258, 1997.
[19] T. Takahashi, H. Kawasaki, K. Ikeuchi, and M. Sakauchi, “Arbitrary View Position and Direction Rendering for Large-Scale Scenes,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 296-303, 2000.
[20] C.J. Taylor, “Video Plus,” Proc. IEEE Workshop Omnidirectional Vision, K. Daniilidis, ed., pp. 3-11, June 2000.
[21] C.J. Taylor, “Video Plus: A Method for Capturing the Structure and Appearance of Immersive Environments,” Proc. Second European Workshop 3D Structure from Multiple Images of Large-Scale Environments, M. Pollefeys, L. van Gool, A. Zisserman, and A. Fitzgibbon, eds., pp. 187-204, July 2000.
[22] C.J. Taylor and D.J. Kriegman, “Minimization on the Lie Group SO(3) and Related Manifolds,” Technical Report 9405, Center for Systems Science, Dept. of Electrical Eng., Yale Univ., New Haven, Conn., Apr. 1994.
[23] C.J. Taylor and D.J. Kriegman, Structure and Motion from Line Segments in Multiple Images IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 11, pp. 1021-1032, Nov. 1995.
[24] Y. Yagi, S. Kawato, and S. Tsuji, “Real-Time Omnidirectional Image Sensor (copis) for Vision-Guided Navigation,” IEEE J. Robotics and Automation, vol. 10, no. 1, pp. 11-21, Feb. 1994.

Index Terms:
reconstruction, immersive environments, omnidirectional video, pose estimation
Citation:
C.J. Taylor, "VideoPlus: A Method for Capturing the Structure and Appearance of Immersive Environments," IEEE Transactions on Visualization and Computer Graphics, vol. 8, no. 2, pp. 171-182, April-June 2002, doi:10.1109/2945.998669
Usage of this product signifies your acceptance of the Terms of Use.