The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.12 - Dec. (2012 vol.34)
pp: 2365-2378
Ning Chen , Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Jun Zhu , Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Fuchun Sun , Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
E. P. Xing , Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
ABSTRACT
Learning salient representations of multiview data is an essential step in many applications such as image classification, retrieval, and annotation. Standard predictive methods, such as support vector machines, often directly use all the features available without taking into consideration the presence of distinct views and the resultant view dependencies, coherence, and complementarity that offer key insights to the semantics of the data, and are therefore offering weak performance and are incapable of supporting view-level analysis. This paper presents a statistical method to learn a predictive subspace representation underlying multiple views, leveraging both multiview dependencies and availability of supervising side-information. Our approach is based on a multiview latent subspace Markov network (MN) which fulfills a weak conditional independence assumption that multiview observations and response variables are conditionally independent given a set of latent variables. To learn the latent subspace MN, we develop a large-margin approach which jointly maximizes data likelihood and minimizes a prediction loss on training data. Learning and inference are efficiently done with a contrastive divergence method. Finally, we extensively evaluate the large-margin latent MN on real image and hotel review datasets for classification, regression, image annotation, and retrieval. Our results demonstrate that the large-margin approach can achieve significant improvements in terms of prediction performance and discovering predictive latent subspace representations.
INDEX TERMS
support vector machines, data analysis, image classification, image representation, image retrieval, learning (artificial intelligence), Markov processes, regression analysis, regression, large-margin predictive latent subspace learning, multiview data analysis, salient multiview data representations, image classification, image retrieval, image annotation, support vector machines, view-level analysis, statistical method, supervising side-information, multiview latent subspace Markov network, latent subspace MN, data likelihood maximization, contrastive divergence method, hotel review datasets, predictive latent subspace representations, Learning systems, Image retrieval, Classification, image retrieval and annotation, Latent subspace model, large-margin learning, classification, regression
CITATION
Ning Chen, Jun Zhu, Fuchun Sun, E. P. Xing, "Large-Margin Predictive Latent Subspace Learning for Multiview Data Analysis", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 12, pp. 2365-2378, Dec. 2012, doi:10.1109/TPAMI.2012.64
REFERENCES
[1] S. Akaho, "A Kernel Method for Canonical Correlation Analysis," Proc. Int'l Meeting on Psychometric Soc., 2001.
[2] K. Ando and T. Zhang, "Two-View Feature Generation Model for Semi-Supervised Learning," Proc. Int'l Conf. Machine Learning, 2007.
[3] D.M. Blei and M.I. Jordan, "Modeling Annotated Data," Proc. ACM Int'l Conf. Information Retrieval, pp. 127-134, 2003.
[4] D.M. Blei and J.D. McAuliffe, "Supervised Topic Models," Proc. Advances in Neural Information Processing Systems, 2007.
[5] D.M. Blei, A.Y. Ng, and M.I. Jordan, "Latent Dirichlet Allocation," J. Machine Learning Research, vol. 3, pp. 993-1022, 2003.
[6] A. Blum and T. Mitchell, "Combining Labeled and Unlabeled Data with Co-Training," Proc. Ann. Conf. Learning Theory, 1998.
[7] U. Brefeld and T. Scheffer, "Co-EM Support Vector Learning," Proc. Int'l Conf. Machine Learning, 2004.
[8] C.J.C. Burges, "A Tutorial on Support Vector Machines for Pattern Recognition," Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121-167, 1998.
[9] K. Chaudhuri, S.M. Kakade, K. Livescu, and K. Sridharan, "Multi-View Clustering via Canonical Correlation Analysis," Proc. Int'l Conf. Machine Learning, 2009.
[10] N. Chen, J. Zhu, and E.P. Xing, "Predictive Subspace Learning for Multi-View Data: A Large Margin Approach," Proc. Advances in Neural Information Processing Systems, 2010.
[11] C.M. Christoudias, R. Urtasun, and T. Darrell, "Multi-View Learning in the Presence of View Disagreement," Proc. Conf. Uncertainty in Artificial Intelligence, 2008.
[12] T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y.T. Zheng, "NUS-WIDE: A Real-World Web Image Database from National University of Singapore," Proc. Int'l Conf. Image and Video Retrieval, 2009.
[13] K. Crammer and Y. Singer, "On the Algorithmic Implementation of Multiclass Kernel-Based Vector Machines," J. Machine Learning Research, vol. 2, pp. 265-292, 2001.
[14] M. Culp, G. Michailidis, and K. Johnson, "On Multi-View Learning with Additive Models," Annals of Applied Statistics, vol. 3, no. 1, pp. 292-318, 2009.
[15] T. Diethe, D.R. Hardoon, and J. Shawe-Taylor, "Multiview Fisher Discriminant Analysis," Proc. NIPS Workshop Learning from Multiple Sources, 2008.
[16] P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan, "Object Detection with Discriminatively Trained Part Based Models," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 9, pp. 1627-1645, Sept. 2010.
[17] V. Ferrari, T. Tuytelaars, and L.V. Gool, "Integrating Multiple Model Views for Object Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2004.
[18] D. Foster, S. Kakade, and T. Zhang, "Multi-View Dimensionality Reduction via Canonical Correlation Analysis," Technical Report TR-2008-4, TTI-Chicago, 2008.
[19] Y. Freund, "An Adaptive Version of the Boost by Majority Algorithm," Machine Learning, vol. 43, no. 3, pp. 293-318, 2001.
[20] K. Fukumizu, F. Bach, and M. Jordan, "Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces," J. Machine Learning Research, vol. 5, pp. 73-99, 2004.
[21] K. Ganchev, J.V. Graca, J. Blitzer, and B. Taskar, "Multi-View Learning over Structured and Non-Identical Outputs," Proc. Conf. Uncertainty in Artificial Intelligence, 2008.
[22] D. Gökalp and S. Aksoy, "Scene Classification Using Bag-of-Regions Representations," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[23] G.E. Hinton, "Training Products of Experts by Minimizing Contrastive Divergence," Neural Computation, vol. 14, no. 8, pp. 1771-1800, 2002.
[24] H. Hotelling, "Relations between Two Sets of Variates," Biometrika, vol. 28, nos. 3/4, pp. 321-377, 1936.
[25] T. Joachims, "Making Large-Scale SVM Learning Practical," Advances in Kernel Methods-Support Vector Learning, pp. 169-184, MIT press, 1999.
[26] S.M. Kakade and D.P. Foster, "Multi-View Regression via Conanical Correlation Analysis," Proc. Ann. Conf. Learning Theory, 2007.
[27] J. Lafferty, A. McCallum, and F. Pereira, "Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data," Proc. Int'l Conf. Machine Learning, 2001.
[28] H. Larochelle and Y. Bengio, "Classification Using Discriminative Restricted Boltzmann Machines," Proc. Int'l Conf. Machine Learning, 2008.
[29] D. Liu and J. Nocedal, "On the Limited Memory BFGS Method for Large Scale Optimization," Math. Programming, vol. 45, pp. 503-528, 1989.
[30] D.G. Lowe, "Object Recognition from Local Scale-Invariant Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1999.
[31] A. McCallum, C. Pal, G. Druck, and X. Wang, "Multi-Conditional Learning: Generative/Discriminative Training for Clustering and Classification," Proc. Nat'l Conf. Artificial Intelligence, 2006.
[32] R. Salakhutdinov and G.E. Hinton, "Replicated Softmax: An Undirected Topic Model," Proc. Advances in Neural Information Processing Systems, 2009.
[33] A.J. Smola and B. Scholkopf, "A Tutorial on Support Vector Regression," Statistics and Computing, vol. 14, no. 3, pp. 199-222, 2003.
[34] A. Thomas, V. Ferrari, B. Leibe, T. Tuytelaars, B. Schiele, and L. VanGool, "Towards Multi-View Object Class Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2006.
[35] A. Torralba, K.P. Murphy, and W.T. Freeman, "Sharing Visual Features for Multiclass and Multiview Object Detection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 5, pp. 854-869, May 2007.
[36] L. van der Maaten and G.E. Hinton, "Visualizing Data Using t-SNE," J. Machine Learning Research, vol. 9, pp. 2579-2605, 2008.
[37] M.J. Wainwright and M.I. Jordan, "Graphical Models, Exponential Families, and Variational Inference," Foundations and Trends in Machine Learning, vol. 1, nos. 1/2, pp. 1-305, 2008.
[38] H.M. Wallach, "Topic Modeling: Beyond Bag-of-Words," Proc. Int'l Conf. Machine Learning, 2006.
[39] C. Wang, D.M. Blei, and L. Fei-Fei, "Simultaneous Image Classification and Annotation," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[40] M. Welling and G.E. Hinton, "A New Learning Algorithm for Mean Field Boltzmann Machines," Proc. Int'l Conf. Artificial Neural Networks, 2001.
[41] M. Welling, M. Rosen-Zvi, and G.E. Hinton, "Exponential Family Harmoniums with an Application to Information Retrieval," Proc. Advances in Neural Information Processing Systems, pp. 1481-1488, 2004.
[42] J. Weston, S. Bengio, and N. Usunier, "Large Scale Image Annotation: Learning to Rank with Joint Word-Image Embeddings," Proc. European Conf. Machine Learning, 2010.
[43] E.P. Xing, M.I. Jordan, and S. Russell, "A Generalized Mean Field Algorithm for Variational Inference in Exponential Families," Proc. Conf. Uncertainty in Artificial Intelligence, 2003.
[44] E.P. Xing, R. Yan, and A.G. Hauptmann, "Mining Associated Text and Images with Dual-Wing Harmoniums," Proc. Conf. Uncertainty in Artificial Intelligence, 2005.
[45] J. Yang, Y. Liu, E.P. Xing, and A.G. Hauptmann, "Harmonium Models for Semantic Video Representation and Classification," Proc. SIAM Conf. Data Mining, 2007.
[46] C. Yu and T. Joachims, "Learning Structural SVMs with Latent Variables," Proc. Int'l Conf. Machine Learning, 2009.
[47] J. Zhang, Z. Ghahramani, and Y. Yang, "Flexible Latent Variable Models for Multi-Task Learning," Machine Learning, vol. 73, no. 3, pp. 221-242, 2008.
[48] J. Zhu, A. Ahmed, and E.P. Xing, "MedLDA: Maximum Margin Supervised Topic Models for Regression and Classification," Proc. Int'l Conf. Machine Learning, 2009.
[49] J. Zhu, L. Li, L. Feifei, and E.P. Xing, "Large Margin Training of Upstream Scene Understanding Models," Proc. Advances in Neural Information Processing Systems, 2010.
[50] J. Zhu and E.P. Xing, "Conditional Topic Random Fields," Proc. Int'l Conf. Machine Learning, 2010.
[51] J. Zhu, E.P. Xing, and B. Zhang, "Partially Observed Maximum Entropy Discrimination Markov Networks," Proc. Advances in Neural Information Processing Systems, pp. 1977-1984, 2008.
7 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool