This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Scalable Mobile Video Retrieval with Sparse Projection Learning and Pseudo Label Mining
July-Sept. 2013 (vol. 20 no. 3)
pp. 47-57
Guan-Long Wu, National Taiwan University
Yin-Hsi Kuo, National Taiwan University
Tzu-Hsuan Chiu, National Taiwan University
Winston H. Hsu, National Taiwan University
Lexing Xie, Australian National University
Retrieving relevant videos from a large corpus on mobile devices is a vital challenge. This article addresses two key issues for mobile search on user-generated videos. The first is the lack of good relevance measurement for learning semantically rich representations, due to the unconstrained nature of online videos. The second is the limited resources on mobile devices, stringent bandwidth, and delay requirement between the device and video server. The authors propose a knowledge-embedded sparse projection learning approach. To alleviate the need for expensive annotation in hash learning, they investigate varying approaches for pseudo label mining, where explicit semantic analysis leverages Wikipedia. In addition, they propose a novel sparse projection method to address the efficiency challenge by learning a discriminative compact representation that drastically reduces transmission costs. With less than 10 percent nonzero elements in the projection matrix, it also reduces computational and storage costs. The experimental results on 100,000 videos show that the proposed algorithm yields performance competitive with the prior state-of-the-art hashing methods, which are not applicable for mobiles and solely rely on costly manual annotations. The average query time for 100,000 videos was only 0.592 seconds.
Index Terms:
Semantics,Mobile communication,Sparse matrices,Mobile handsets,Encyclopedias,Electronic publishing,mobile video retrieval,Semantics,Mobile communication,Sparse matrices,Mobile handsets,Encyclopedias,Electronic publishing,explicit semantic analysis,multimedia,multimedia applications,content-based video search,hashing,sparsity
Citation:
Guan-Long Wu, Yin-Hsi Kuo, Tzu-Hsuan Chiu, Winston H. Hsu, Lexing Xie, "Scalable Mobile Video Retrieval with Sparse Projection Learning and Pseudo Label Mining," IEEE Multimedia, vol. 20, no. 3, pp. 47-57, July-Sept. 2013, doi:10.1109/MMUL.2013.13
Usage of this product signifies your acceptance of the Terms of Use.