This Article 
 Bibliographic References 
 Add to: 
On Active Camera Control and Camera Motion Recovery with Foveate Wavelet Transform
August 2001 (vol. 23 no. 8)
pp. 896-903

Abstract—In this paper, a new variable resolution technique–Foveate Wavelet Transform (FWT) is proposed to represent digital images in an effort to efficiently represent visual data. Compared to existing variable resolution techniques, the strength of the proposed scheme encompasses its linearity preservation, orientation selectivity, and flexibility while supporting interesting behaviors resembling the animate vision system. The linearity preservation of the FWT is due to the fact that only low and/or high-pass filterings are carried out in different regions of an image in the transform. The orientation selectivity indicates the fact that details along the horizontal, vertical, and diagonal directions are readily available in the FWT representation. The flexibility of this new representation technique is witnessed by the readiness of its extensions to represent foveae of different number, shape, and locations. To demonstrate the efficacy of the FWT, two applications are presented. First, an FWT-based active camera control scheme is developed, where the computer can move a camera to track the moving object in the scene. Second, an FWT-based method purporting to recover pan/tilt/zoom camera movements from video clips is developed. Experiments of these two applications have shown encouraging performances.

[1] C. Bandera and P. Scott, “Foveal Machine Vision Systems,” Proc. IEEE Int'l Conf. System, Man, and Cybernetics, pp. 596-599, 1989.
[2] R.H.S. Carpenter, Movements of the Eyes. London: Pion, 1977.
[3] E.C. Chang and C. Yap, “A Wavelet Approach to Foveating Images,” Proc. ACM Symp. Computational Geometry, pp. 397-399, 1997.
[4] J. Crowley and H.I. Christensen, Vision as Process. Berlin: Springer-Verlag, 1995.
[5] T. Darrell, G. Gordon, M. Harville, and J. Woodfill, “Integrated Person Tracking Using Stereo, Color, and Pattern Detection,” Proc. Computer Vision and Pattern Recognition (CVPR '98), pp. 601-608, 1998.
[6] F. Ferrari, J. Nielsen, P. Questa, and G. Sandini, “Space Variant Imaging,” Sensor Rev., vol. 15, no. 2, pp. 17-20, 1995.
[7] M. Irani, B. Rousso, and S. Peleg, “Computing Occluding and Transparent Motions,” Int'l J. Computer Vision, vol. 12, no. 1, pp. 5-16, Jan. 1994.
[8] W.N. Klarquist and A.C. Bovik, “Fovea a Foveated Vergent Active Stereo Vision System for Dynamic Three-Dimensional Scene Recovery,” Trans. Robotics and Automation, vol. 14, no. 5, pp. 755-780, 1998.
[9] S. Mallat and S. Zhong, “Characterization of Signals from Multiscale Edges,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, no. 7, pp. 710-732, July 1992.
[10] I.D Reid and D.W. Murray, “Active Tracking of Foveated Feature Clusters Using Affine Structure,” Int'l J. Computer Vision, vol. 18, no. 1, pp. 1-20, 1996.
[11] G. Sandini and P. Dario, “Active Vision Based on Space-Variant Sensing,” Proc. Int'l Symp. Robotics Research, pp. 75-83, 1990.
[12] G. Sandini et al. “Image-Based Personal Communication Using an Innovative Space-Variant CMOS Sensor,” Proc. IEEE Int'l Workshop on Robot and Human Comm., pp. 158-163, 1996.
[13] E.L. Schwartz, “Computational Anatomy and Functional Architecture of Striate Cortex: A Spatial Mapping Approach to Perceptual Coding,” Vision Research, vol. 30, pp. 645-669, 1980.
[14] S.M. Smith and J.M. Brady, “A Scene Segmenter: Visual Tracking of Moving Vehicles,” Eng. Applications of Artificial Intelligence, vol. 7, no. 2, pp. 191-204, 1994.
[15] M.J. Swain and M.A. Stricker, “Promising Directions in Active Vision,” Int'l J. Computer Vision, vol. 11, no. 2, pp. 109-126, 1993.
[16] F. Tong and Z.N. Li, “Reciprocal-Wedge Transform for Space-Variant Sensing,” Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 6, pp. 500-511, 1995.
[17] J.Y.A. Wang and E.H. Adelson, Representing Moving Images with Layers IEEE Trans. Image Processing, vol. 3, no. 5, pp. 625-638, Sept. 1994.
[18] J. Wei, “Foveate Wavelet Transform and Its Applications in Digital Video Processing, Acquisition, and Indexing,” PhD thesis, Simon Fraser Univ., 1998.
[19] J. Wei and Z.N. Li, “Efficient Disparity-Based Gaze Control with Foveate Wavelet Transform,” Proc. Int'l Conf. Intelligent Robots and Systems, (IROS '98). pp. 866-871, 1998.
[20] J. Wei and Z.N. Li, “Foveate Wavelet Transform for Camera Motion Recovery from Videos,” Proc. Int'l Conf. Pattern Recognition, (ICPR,'98), pp. 1445-1448, 1998.
[21] J. Wei and Z.N. Li, “The MAP-MRF Estimation of Motion Vectors Based on Mean Field Theory,” Trans. Circ. and Systems on Video Technology, vol. 9, no. 6, pp. 960-972, 1999.
[22] C.F.R. Weiman and G. Chaikin, “Logarithmic Spiral Grids for Image Processing and Display,” Computer Graphics and Image Processing, vol. 11, pp. 197-226, 1979.
[23] K. Wiebe, “Variable Resolution Vision: Biologically Motivated Foveal Compression and Prioritization,” PhD thesis, Univ. of Alberta, 1996.
[24] K.J. Wiebe and A. Basu, “Modelling Ecologically Specialized Biological Visual Systems,” Pattern Recognition, vol. 30, no. 10, pp. 1687-1703, 1997.
[25] J. Zhang and G.G. Hanauer, “The Application of Mean Field Theory to Image Motion Estimation,” Trans. Image Processing, vol. 4, no. 1, pp. 19-32, 1995.

Index Terms:
Active vision, wavelet transform, variable resolution techniques, gaze control, object tracking, motion detection.
Jie Wei, Ze-Nian Li, "On Active Camera Control and Camera Motion Recovery with Foveate Wavelet Transform," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 8, pp. 896-903, Aug. 2001, doi:10.1109/34.946992
Usage of this product signifies your acceptance of the Terms of Use.