The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.03 - July-Sept. (2013 vol.20)
pp: 72-86
Yonghong Tian , Peking University
Tiejun Huang , Peking University
Menglin Jiang , Peking University
Wen Gao , Peking University
ABSTRACT
For video copy detection, no single audio-visual feature, or single detector based on several features, can work well for all transformations. This article proposes a novel video copy-detection and localization approach with scalable cascading of complementary detectors and multiscale sequence matching. In this cascade framework, a soft-threshold learning algorithm is utilized to estimate the optimal decision thresholds for detectors, and a multiscale sequence matching method is employed to precisely locate copies using a 2D Hough transform and multigranularities similarity evaluation. Excellent performance on the TRECVID-CBCD 2011 benchmark dataset shows the effectiveness and efficiency of the proposed approach.
INDEX TERMS
VIdeo coding, Videos, Threshold analysis, Learning systems, Sequential analysis, Multimedia communication, multiscale sequence matching, TRECVID-CBCD, multimedia, video copy detection, scalable cascading, complementary detectors, soft threshold learning
CITATION
Yonghong Tian, Tiejun Huang, Menglin Jiang, Wen Gao, "Video Copy-Detection and Localization with a Scalable Cascading Framework", IEEE MultiMedia, vol.20, no. 3, pp. 72-86, July-Sept. 2013, doi:10.1109/MMUL.2012.62
REFERENCES
1. W. Kraaij and G. Awad, TRECVID 2011 Content-Based Copy Detection: Task Overview, Nov. 2011; www-nlpir.nist.gov/projects/tvpubs/tv11.slides tv11.ccd.slides.pdf.
2. Y.H. Tian et al., "A Multimodal Video Copy Detection Approach with Sequential Pyramid Matching," Proc. 18th IEEE Int'l Conf. Image Processing (ICIP), IEEE CS, 2011, pp. 3629–3632.
3. S.K. Wei et al., "Frame Fusion for Video Copy Detection," IEEE Trans. Circuits and Systems for Video Technology, vol. 21, no. 1, 2011, pp. 15–28.
4. P. Viola and M. Jones, "Rapid Object Detection Using a Boosted Cascade of Simple Features," Proc. IEEE Computer Soc. Conf. Computer Vision and Pattern Recognition (CVPR), vol. 1, IEEE CS, 2001, pp. 511–518.
5. J.P. Chen and T.J. Huang, "A Robust Feature Extraction Algorithm for Audio Fingerprinting," Proc. 9th Pacific Rim Conf. Multimedia: Advances in Multimedia Information Processing (PCM), Springer-Verlag, 2008, pp. 887–890.
6. A. Bosch, A. Zisserman, and X. Muoz, "Scene Classification Using a Hybrid Generative/ Discriminative Approach," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 30, no. 4, 2008, pp. 712–727.
7. K. Terasawa and Y. Tanaka, "Spherical LSH for Approximate Nearest Neighbor Search on Unit Hypersphere," Proc. 10th Int'l Conf. Algorithms and Data Structures (WADS 2007), LNC S 4619, Springer-Verlag, 2007, pp. 27–38.
8. B. Liu et al., "Real-Time Video Copy-Location Detection in Large-Scale Repositories," IEEE Multimedia, vol. 18, no. 3, 2011, pp. 22–31.
9. S. Lazebnik, C. Schmid, and J. Ponce, "Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories," Proc. IEEE Computer Soc. Conf. Computer Vision and Pattern Recognition (CVPR), vol. 2, IEEE CS, 2006, pp. 2169–2178.
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool