Learning AND-OR Templates for Object Recognition and Detection
Found in: IEEE Transactions on Pattern Analysis and Machine Intelligence
By Zhangzhang Si, Song-Chun Zhu
Issue Date:September 2013
pp. 2189-2205
This paper presents a framework for unsupervised learning of a hierarchical reconfigurable image template - the AND-OR Template (AOT) for visual objects. The AOT includes: 1) hierarchical composition as
Learning Hybrid Image Templates (HIT) by Information Projection
Found in: IEEE Transactions on Pattern Analysis and Machine Intelligence
By Zhangzhang Si,Song-Chun Zhu
Issue Date:July 2012
pp. 1354-1367
This paper presents a novel framework for learning a generative image representation—the hybrid image template (HIT) from a small number (i.e., 3 \sim 20) of image examples. Each learned template is composed of, typically, 50 \sim 500 image patches whose g...
Unsupervised learning of event AND-OR grammar and semantics from video
Found in: Computer Vision, IEEE International Conference on
By Zhangzhang Si,Mingtao Pei,Benjamin Yao,Song-Chun Zhu
Issue Date:November 2011
pp. 41-48
We study the problem of automatically learning event AND-OR grammar from videos of a certain environment, e.g. an office where students conduct daily activities. We propose to learn the event grammar under the information projection and minimum description...
Learning mixed templates for object recognition
Found in: Computer Vision and Pattern Recognition, IEEE Computer Society Conference on
By Zhangzhang Si, Haifeng Gong, Ying Nian Wu, Song-Chun Zhu
Issue Date:June 2009
pp. 272-279
This article proposes a method for learning object templates composed of local sketches and local textures, and investigates the relative importance of the sketches and textures for different object categories. Local sketches and local textures in the obje...
Deformable Template As Active Basis
Found in: Computer Vision, IEEE International Conference on
By Ying Nian Wu, Zhangzhang Si, Chuck Fleming, Song-Chun Zhu
Issue Date:October 2007
pp. 1-8
This article proposes an active basis model and a shared pursuit algorithm for learning deformable templates from image patches of various object categories. In our generative model, a deformable template is in the form of an active basis, which consists o...
Wavelet, active basis, and shape script: a tour in the sparse land
Found in: Proceedings of the international conference on Multimedia information retrieval (MIR '10)
By Ying Nian Wu, Zhangzhang Si
Issue Date:March 2010
pp. 201-210
Sparse coding is a key principle that underlies wavelet representation of natural images. In this paper, we explain that the effort of seeking a common wavelet sparse coding of images from the same object category leads to an active basis model, where the ...