This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Enhancing Bag-of-Words Models with Semantics-Preserving Metric Learning
January-March 2011 (vol. 18 no. 1)
pp. 24-37
Lei Wu, Michigan State University
Steven C.H. Hoi, Nanyang Technological University

The authors present an online semantics-preserving, metric-learning technique for improving the bag-of-words model and addressing the semantic-gap issue.

1. L. Wu et al., "Distance Metric Learning from Uncertain Side Information with Application to Automated Photo Tagging," Proc. 17th ACM Int'l Conf. Multimedia, ACM Press, 2009, pp. 135-144.
2. G. Carneiro and N. Vasconcelos, "Formulating Semantic Image Annotation as a Supervised Learning Problem," Proc. IEEE Conf. Computer Vision and Pattern Recognition, IEEE CS Press, 2005, pp. 163-168.
3. L. Wu et al., "Scale-Invariant Visual Language Modeling for Object Categorization," IEEE Trans. Multimedia, vol. 11, no. 2, 2009, pp. 286-294.
4. D.D. Lewis, "Naive (Bayes) At Forty: The Independence Assumption in Information Retrieval," Proc. 10th European Conf. Machine Learning, no. 1398, Assoc. Computational Linguistics, 1998, pp. 4-15.
5. D.G. Lowe, "Distinctive Image Features from Scale-Invariant Key points," Int. J. Computer Vision, vol. 60, 2004, pp. 91-110.
6. J.A. Hartigan, Clustering Algorithms, John Wiley & Sons, 1975.
7. L. Wu, S.C.H. Hoi, and N. Yu, "Semantics-Preserving Bag-of-Words Models and Applications," IEEE Trans. Image Processing, vol. 19, no. 7, 2010, pp. 1908-1920.
8. B.C. Russell et al., "Labelme: A Database and Web-Based Tool for Image Annotation," Int. J. Computer Vision, vol. 77, nos. 1-3, 2008, pp. 157-173.
9. A. Bar-Hillel et al., "Learning Distance Functions Using Equivalence Relations." Proc. Int'l Conf. on Machine Learning, AAAI Press, 2003, pp. 11-18.
10. J.V. Davis et al., "Information-Theoretic Metric Learning," Proc. Int'l Conf. Machine Learning, ACM Press, 2007, pp. 209-216.
11. K. Weinberger, J. Blitzer, and L. Saul, "Distance Metric Learning for Large Margin Nearest Neighbor Classification," Advances in Neural Information Processing Systems, vol. 18, 2006, pp. 1473-1480.
12. J. Goldberger et al., "Neighborhood Component Analysis," Advances in Neural Information Processing Systems, The MIT Press, 2004.
13. J.C. van Gemert et al., "Visual Word Ambiguity," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 7, 2010, pp. 1271-1283.
14. F. Perronnin et al., "Adapted Vocabularies for Generic Visual Categorization," Proc. European Conf. Computer Vision, IOS Press, 2006, pp. 464-475.
15. M. Everingham et al., The PASCAL Visual Object Classes Challenge 2006 (VOC2006) Results, 2006; http://www.pascal-network.org/challenges/ VOC/voc2006results.pdf.
1. J. Yang et al., "Evaluating Bag-of-Visual-Words Representations in Scene Classification," Proc. Int'l Workshop Multimedia Information Retrieval, ACM Press, 2007, pp. 197-206.
2. J.C. van Gemert et al., "Visual Word Ambiguity," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 7, 2010, pp. 1271-1283.
3. L. Wu, S.C.H. Hoi, and N. Yu, "Semantics-Preserving Bag-of-Words Models and Applications," IEEE Trans. Image Processing, vol. 19, no. 7, 2010, pp. 1908-1920.
4. J. Goldberger et al., "Neighborhood Component Analysis," Advances in Neural Information Processing Systems, The MIT Press, 2004.
5. A. Globerson and S. Roweis, "Metric Learning by Collapsing Classes," Advances in Neural Information Processing Systems, The MIT Press, 2005.
6. K. Weinberger, J. Blitzer, and L. Saul, "Distance Metric Learning for Large Margin Nearest Neighbor Classification," Advances in Neural Information Processing Systems, vol. 18, 2006, pp. 1473-1480.
7. J.V. Davis et al., "Information-Theoretic Metric Learning," Proc. Int'l Conf. Machine Learning, ACM Press, 2007, pp. 209-216.
8. A. Bar-Hillel et al., "Learning Distance Functions Using Equivalence Relations," Proc. Int'l Conf. Machine Learning, AAAI Press, 2003, pp. 11-18.
9. S.C.H. Hoi et al., "Learning Distance Metrics with Contextual Constraints for Image Retrieval," Proc. IEEE Conf. Computer Vision and Pattern Recognition, IEEE CS Press, 2006.

Index Terms:
bag-of-words models, semantic gap, distance metric learning, object codebook, image annotation, object recognition, multimedia and graphics, IEEE MultiMedia
Citation:
Lei Wu, Steven C.H. Hoi, "Enhancing Bag-of-Words Models with Semantics-Preserving Metric Learning," IEEE Multimedia, vol. 18, no. 1, pp. 24-37, Jan.-March 2011, doi:10.1109/MMUL.2011.7
Usage of this product signifies your acceptance of the Terms of Use.