The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.11 - Nov. (2013 vol.35)
pp: 2651-2664
Zhuolin Jiang , Inst. for Adv. Comput. Studies, Univ. of Maryland, College Park, MD, USA
Zhe Lin , Adv. Technol. Labs., Adobe, San Jose, CA, USA
L. S. Davis , Inst. for Adv. Comput. Studies, Univ. of Maryland, College Park, MD, USA
ABSTRACT
A label consistent K-SVD (LC-KSVD) algorithm to learn a discriminative dictionary for sparse coding is presented. In addition to using class labels of training data, we also associate label information with each dictionary item (columns of the dictionary matrix) to enforce discriminability in sparse codes during the dictionary learning process. More specifically, we introduce a new label consistency constraint called "discriminative sparse-code error" and combine it with the reconstruction error and the classification error to form a unified objective function. The optimal solution is efficiently obtained using the K-SVD algorithm. Our algorithm learns a single overcomplete dictionary and an optimal linear classifier jointly. The incremental dictionary learning algorithm is presented for the situation of limited memory resources. It yields dictionaries so that feature points with the same class labels have similar sparse codes. Experimental results demonstrate that our algorithm outperforms many recently proposed sparse-coding techniques for face, action, scene, and object category recognition under the same learning conditions.
INDEX TERMS
Dictionaries, Linear programming, Classification algorithms, Training, Algorithm design and analysis, Image reconstruction, Testing,discriminative sparse-code error, Discriminative dictionary learning, incremental dictionary learning, supervised learning, label consistent K-SVD
CITATION
Zhuolin Jiang, Zhe Lin, L. S. Davis, "Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.35, no. 11, pp. 2651-2664, Nov. 2013, doi:10.1109/TPAMI.2013.88
REFERENCES
[1] M. Elad and M. Aharon, "Image Denosing via Sparse and Redundant Representations over Learned Dictionaries," IEEE Trans. Image Processing, vol. 54, no. 12, pp. 3736-3745, Dec. 2006.
[2] M. Ranzato, C. Poultney, S. Chopra, and Y. LeCun, "Efficient Learning of Sparse Representations with an Energy-Based Model," Proc. Conf. Neural Information Processing Systems, 2006.
[3] J. Mairal, F. Bach, J. Ponce, and G. Sapiro, "Online Learning for Matrix Factorization and Sparse Coding," J. Machine Learning Research, vol. 11, pp. 19-60, 2010.
[4] J. Yang, J. Wright, T. Huang, and Y. Ma, "Image Superresolution as Sparse Representation of Raw Patches," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[5] J. Yang, K. Yu, Y. Gong, and T. Huang, "Linear Spatial Pyramid Matching Using Sparse Coding for Image Classification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[6] J. Wright, M. Yang, A. Ganesh, S. Sastry, and Y. Ma, "Robust Face Recognition via Sparse Representation," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 210-227, Feb. 2009.
[7] D. Bradley and J. Bagnell, "Differential Sparse Coding," Proc. Conf. Neural Information Processing Systems, 2008.
[8] J. Yang and M. Yang, "Top-Down Visual Saliency via Joint CRF and Dictionary Learning," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2012.
[9] J. Mairal, F. Bach, and J. Ponce, "Task-Driven Dictionary Learning," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 34, no. 4, pp. 791-804, Apr. 2012.
[10] H. Lee, A. Battle, R. Raina, and A.Y. Ng, "Efficient Sparse Coding Algorithms," Proc. Conf. Neural Information Processing Systems, 2006.
[11] M. Aharon, M. Elad, and A. Bruckstein, "K-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation," IEEE Trans. Signal Processing, vol. 54, no. 1, pp. 4311-4322, Nov. 2006.
[12] J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong, "Locality-Constrained Linear Coding for Image Classification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[13] G. Karol and Y. LeCun, "Learning Fast Approximations of Sparse Coding," Proc. Int'l Conf. Machine Learning, 2010.
[14] J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman, "Discriminative Learned Dictionaries for Local Image Analysis," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[15] D. Pham and S. Venkatesh, "Joint Learning and Dictionary Construction for Pattern Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[16] Q. Zhang and B. Li, "Discriminative K-SVD for Dictionary Learning in Face Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[17] K. Engan, S. Aase, and J. Husφy, "Frame Based Signal Compression Using Method of Optimal Directions (mod)," Proc. IEEE Int'l Symp. Circuits and Systems, 1999.
[18] R. Jenatton, J. Mairal, G. Obozinski, and F. Bach, "Proximal Methods for Sparse Hierarchical Dictionary Learning," Proc. Int'l Conf. Machine Learning, 2010.
[19] R. Raina, A. Battle, H. Lee, B. Packer, and A.Y. Ng, "Self-Taught Learning: Transfer Learning from Unlabeled Data," Proc. Int'l Conf. Machine Learning, 2007.
[20] K. Kavukcuoglu, M. Ranzato, R. Fergus, and Y. LeCun, "Learning Invariant Features through Topographic Filter Maps," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[21] M. Yang, L. Zhang, X. Feng, and D. Zhang, "Fisher Discrimination Dictionary Learning for Sparse Representation," Proc. IEEE Int'l Conf. Computer Vision, 2011.
[22] W. Zhang, A. Surve, X. Fern, and T. Dietterich, "Learning Non-Redundant Codebooks for Classifying Complex Objects," Proc. Int'l Conf. Machine Learning, 2009.
[23] L. Yang, R. Jin, R. Sukthankar, and F. Jurie, "Unifying Discriminative Visual Codebook Genearation with Classifier Training for Object Category Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[24] J. Mairal, M. Leordeanu, F. Bach, M. Hebert, and J. Ponce, "Discriminative Sparse Image Models for Class-Specific Edge Detection and Image Interpretation," Proc. European Conf. Computer Vision, 2008.
[25] I. Ramirez, P. Sprechmann, and G. Sapiro, "Classification and Clustering via Dictionary Learning with Structured Incoherence and Shared Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[26] N. Zhou, Y. Shen, J. Peng, and J. Fan, "Learning Inter-Related Visual Dictionary for Object Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2012.
[27] F. Perronnin, "Universal and Adapted Vocabularies for Generic Visual Categorization," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 30, no. 7, pp. 1243-1256, July 2008.
[28] R. Sivalingam, D. Boley, V. Morellas, and N. Papanikolopoulos, "Positive Definite Dictionary Learning for Region Covariances," Proc. IEEE Int'l Conf. Computer Vision, 2011.
[29] J. Winn, A. Criminisi, and T. Minka, "Object Categorization by Learned Universal Visual Dictionary," Proc. IEEE Int'l Conf. Computer Vision, 2005.
[30] J. Liu and M. Shah, "Learning Human Actions via Information Maximization," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[31] S. Lazebnik and M. Raginsky, "Supervised Learning of Quantizer Codebooks by Information Loss Minimization," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 7, pp. 1294-1309, July 2009.
[32] Q. Qiu, Z. Jiang, and R. Chellappa, "Sparse Dictionary-Based Representation and Recognition of Action Attributes," Proc. IEEE Int'l Conf. Computer Vision, 2011.
[33] B. Fulkerson, A. Vedaldi, and S. Soatto, "Localizing Objects with Smart Dictionaries," Proc. European Conf. Computer Vision, 2008.
[34] N. Slonim and N. Tishby, "Agglomerative Information Bottleneck," Proc. Conf. Neural Information Processing Systems, 1999.
[35] A. Krause and V. Cevher, "Submodular Dictionary Selection for Sparse Representation," Proc. Int'l Conf. Machine Learning, 2010.
[36] Z. Jiang, G. Zhang, and L. Davis, "Submodular Dictionary Learning for Sparse Coding," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2012.
[37] J. Yang, K. Yu, and T. Huang, "Supervised Translation-Invariant Sparse Coding," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[38] J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman, "Supervised Dictionary Learning," Proc. Conf. Neural Information Processing Systems, 2009.
[39] Y. Boureau, F. Bach, Y. LeCun, and J. Ponce, "Learning Mid-Level Features for Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[40] K. Huang and S. Aviyente, "Sparse Representation for Signal Classification," Proc. Conf. Neural Information Processing Systems, 2007.
[41] X. Lian, Z. Li, C. Wang, B. Lu, and L. Zhang, "Probabilistic Models for Supervised Dictinary Learning," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[42] X. Lian, Z. Li, B. Lu, and L. Zhang, "Max-Margin Dictionary Learning for Multiclass Image Categorization," Proc. European Conf. Computer Vision, 2010.
[43] J. Wright, Y. Ma, J. Mairal, G. Sapiro, T. Huang, and S. Yan, "Sparse Representation for Computer Vision and Pattern Recognition," Proc. IEEE, vol. 98, no. 6, pp. 1031-1044, June 2010.
[44] J. Tropp and A. Gilbert, "Signal Recovery from Random Measurements via Orthogonal Matching Pursuit," IEEE Trans. Information Theory, vol. 53, no. 12, pp. 4655-4666, Dec. 2007.
[45] G. Golub, P. Hansen, and D. O'leary, "Tikhonov Regularization and Total Least Squares," SIM J. Matrix Analysis Applications, vol. 21, no. 1, pp. 185-194, 1999.
[46] E. Candes and T. Tao, "Decoding by Linear Programming," IEEE Trans. Information Theory, vol. 51, no. 12, pp. 4203-4215, Dec. 2005.
[47] N. Murata, "Statistical Study on On-Line Learning," On-Line Learning in Neural Networks, Cambridge Univ. Press, 1999.
[48] S. Lazebnik, C. Schmid, and J. Ponce, "Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[49] S. Sadanand and J. Corso, "Action Bank: A High-Level Representation of Activity in Video," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2012.
[50] A. Georghiades, P. Belhumeur, and D. Kriegman, "From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 6, pp. 643-660, June 2001.
[51] A. Martinez and R. Benavente, "The AR Face Database," CVC Technical Report 24, 1998.
[52] L. FeiFei, R. Fergus, and P. Perona, "Learning Generative Visual Models from Few Training Samples: An Incremental Bayesian Approach Tested on 101 Object Categories," Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshop Generative Model Based Vision, 2004.
[53] G. Griffin, A. Holub, and P. Perona, "Caltech-256 Object Category Data Set," CIT Technical Report 7694, 2007.
[54] M. Rodriguez, J. Ahmed, and M. Shah, "A Spatio-Temporal Maximum Average Correlation Height Filter for Action Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[55] H. Zhang, A. Berg, M. Maire, and J. Malik, "SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2006.
[56] O. Boiman, E. Shechtman, and M. Irani, "In Defense of Nearest-Neighbor Based Image Classification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[57] P. Jain, B. Kullis, and K. Grauman, "Fast Image Search for Learned Metrics," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[58] J. Gemert, J. Geusebroek, C. Veenman, and A. Smeulders, "Kernel Codebooks for Scene Categorization," Proc. European Conf. Computer Vision, 2008.
[59] S. Gao, I. Tsang, L. Chia, and P. Zhao, "Laplacian Sparse Coding for Image Classification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[60] L. Yeffet and L. Wolf, "Local Trinary Patterns for Human Action Recognition," Proc. IEEE Int'l Conf. Computer Vision, 2009.
[61] X. Wu, D. Xu, L. Duan, and J. Luo, "Action Recognition Using Context and Appearance Distribution Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2011.
[62] A. Kovashka and K. Grauman, "Learning a Hierarchy of Discriminative Space-Time Neighborhood Features for Human Action Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[63] Q. Le, W. Zou, S. Yeung, and A. Ng, "Learning Hierarchical Invariant Spatiotemporal Features for Action Recognition with Independent Subspace Analysis," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2011.
[64] A. Yao, J. Gall, and L.V. Gool, "A Hough Transform-Based Voting Framework for Action Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
199 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool