Issue No. 10 - October (2005 vol. 27)
Ko Nishino , IEEE
Shree K. Nayar , IEEE
Tony Jebara , IEEE
Principal Component Analysis (PCA) is extensively used in computer vision and image processing. Since it provides the optimal linear subspace in a least-square sense, it has been used for dimensionality reduction and subspace analysis in various domains. However, its scalability is very limited because of its inherent computational complexity. We introduce a new framework for applying PCA to visual data which takes advantage of the spatio-temporal correlation and localized frequency variations that are typically found in such data. Instead of applying PCA to the whole volume of data (complete set of images), we partition the volume into a set of blocks and apply PCA to each block. Then, we group the subspaces corresponding to the blocks and merge them together. As a result, we not only achieve greater efficiency in the resulting representation of the visual data, but also successfully scale PCA to handle large data sets. We present a thorough analysis of the computational complexity and storage benefits of our approach. We apply our algorithm to several types of videos. We show that, in addition to its storage and speed benefits, the algorithm results in a useful representation of the visual data.
Index Terms- Principal component analysis, singular value decomposition, eigenvalues and eigenvectors, natural image statistics, clustering, region growing/partitioning.
K. Nishino, T. Jebara and S. K. Nayar, "Clustered Blockwise PCA for Representing Visual Data," in IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 27, no. , pp. 1675-1679, 2005.