The Community for Technology Leaders
Green Image
Issue No. 03 - July-Sept. (2013 vol. 20)
ISSN: 1070-986X
pp: 72-86
Wen Gao , Peking University
Menglin Jiang , Peking University
Tiejun Huang , Peking University
Yonghong Tian , Peking University
ABSTRACT
For video copy detection, no single audio-visual feature, or single detector based on several features, can work well for all transformations. This article proposes a novel video copy-detection and localization approach with scalable cascading of complementary detectors and multiscale sequence matching. In this cascade framework, a soft-threshold learning algorithm is utilized to estimate the optimal decision thresholds for detectors, and a multiscale sequence matching method is employed to precisely locate copies using a 2D Hough transform and multigranularities similarity evaluation. Excellent performance on the TRECVID-CBCD 2011 benchmark dataset shows the effectiveness and efficiency of the proposed approach.
INDEX TERMS
VIdeo coding, Videos, Threshold analysis, Learning systems, Sequential analysis, Multimedia communication, multiscale sequence matching, TRECVID-CBCD, multimedia, video copy detection, scalable cascading, complementary detectors, soft threshold learning
CITATION
Wen Gao, Menglin Jiang, Tiejun Huang, Yonghong Tian, "Video Copy-Detection and Localization with a Scalable Cascading Framework", IEEE MultiMedia, vol. 20, no. , pp. 72-86, July-Sept. 2013, doi:10.1109/MMUL.2012.62
107 ms
(Ver 3.1 (10032016))