The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.10 - October (2010 vol.22)
pp: 1372-1387
Xiangmin Zhou , Canberra ICT Center, CSIRO, Australia
Xiaofang Zhou , The University of Queensland, Brisbane
Lei Chen , Hong Kong University of Science and Technology, Hong Kong
Yanfeng Shu , Tasmanian ICT Center, CSIRO, Australia
Athman Bouguettaya , Canberra ICT Center, CSIRO, Australia
John A. Taylor , Canberra CMIS Center, CSIRO, Australia
ABSTRACT
Efficiently and effectively identifying similar videos is an important and nontrivial problem in content-based video retrieval. This paper proposes a subspace symbolization approach, namely SUDS, for content-based retrieval on very large video databases. The novelty of SUDS is that it explores the data distribution in subspaces to build a visual dictionary with which the videos are processed by deriving the string matching techniques with two-step data simplification. Specifically, we first propose an adaptive approach, called VLP, to extract a series of dominant subspaces of variable lengths from the whole visual feature space without the constraint of dimension consecutiveness. A stable visual dictionary is built by clustering the video keyframes over each dominant subspace. A compact video representation model is developed by transforming each keyframe into a word that is a series of symbols in the dominant subspaces, and further each video into a series of words. Then, we present an innovative similarity measure called CVE, which adopts a complementary information compensation scheme based on the visual features and sequence context of videos. Finally, an efficient two-layered index strategy with a number of query optimizations is proposed to facilitate video retrieval. The experimental results demonstrate the high effectiveness and efficiency of SUDS.
INDEX TERMS
Video detection, subspace symbolization, variable length partition, query optimization.
CITATION
Xiangmin Zhou, Xiaofang Zhou, Lei Chen, Yanfeng Shu, Athman Bouguettaya, John A. Taylor, "Adaptive Subspace Symbolization for Content-Based Video Detection", IEEE Transactions on Knowledge & Data Engineering, vol.22, no. 10, pp. 1372-1387, October 2010, doi:10.1109/TKDE.2009.171
REFERENCES
[1] D.A. Adjeroh, M.C. Lee, and I. King, "A Distance Measure for Video Sequences," Computer Vision and Image Understanding, vol. 75, nos. 1/2, pp. 25-45, 1999.
[2] S. Berchtold, C. Böhm, and H.-P. Kriegel, "The Pyramid-Technique: Towards Breaking the Curse of Dimensionality," Proc. ACM SIGMOD, pp. 142-153, 1998.
[3] M. Bertini, A. Del Bimbo, and W. Nunziati, "Video Clip Matching Using Mpeg-7 Descriptors and Edit Distance," Proc. ACM Int'l Conf. Image and Video Retrieval (CIVR), pp. 133-142, 2006.
[4] K.S. Beyer, J. Goldstein, R. Ramakrishnan, and U. Shaft, "When is 'Nearest Neighbor' Meaningful?" Proc. Int'l Conf. Database Theory (ICDT), pp. 217-235, 1999.
[5] C. Bohm, S. Berchtold, and D. Keim, "Searching in High-Dimensional Spaces: Index Structures for Improving the Performance of Multimedia Databases," ACM Computing Surveys, vol. 33, no. 3, pp. 322-373, 2001.
[6] L. Chen and R.T. Ng, "On the Marriage of Lp-Norms and Edit Distance," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 792-803, 2004.
[7] L. Chen, M.T. Özsu, and V. Oria, "Robust and Fast Similarity Search for Moving Object Trajectories," Proc. ACM SIGMOD, pp. 491-502, 2005.
[8] S.S. Cheung and A. Zakhor, "Efficient Video Similarity Measurement with Video Signature," IEEE Trans. Circuits and Systems for Video Technology, vol. 13, no. 1, pp. 59-74, Jan. 2003.
[9] S.S. Cheung and A. Zakhor, "Fast Similarity Search and Clustering of Video Sequences on the World-Wide-Web," IEEE Trans. Multimedia, vol. 7, no. 3, pp. 524-537, June 2005.
[10] P. Ciaccia, M. Patella, and P. Zezula, "M-Tree: An Efficient Access Method for Similarity Search in Metric Spaces," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 426-435, 1997.
[11] B. Cui, B.C. Ooi, J. Su, and K.-L. Tan, "Indexing High-Dimensional Data for Efficient In-Memory Similarity Search," IEEE Trans. Knowledge and Data Eng., vol. 17, no. 3, pp. 339-353, Mar. 2005.
[12] M. Fayzullin, V.S. Subrahmanian, A. Picariello, and M.L. Sapino, "The CPR Model for Summarizing Video," Multimedia Tools and Applications, vol. 26, no. 2, pp. 153-173, 2005.
[13] A. Gionis, P. Indyk, and R. Motwani, "Similarity Search in High Dimensions via Hashing," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 518-529, 1999.
[14] Y. Gong and X. Liu, "Video Summarization and Retrieval Using Singular Value Decomposition," Multimedia Systems, vol. 9, no. 2, pp. 157-168, 2003.
[15] A. Guttman, "R-Trees: A Dynamic Index Structure for Spatial Searching," Proc. ACM SIGMOD, pp. 47-57, 1984.
[16] Z. Huang, H.T. Shen, J. Shao, B. Cui, and X. Zhou, "Bounded Coordinate System Indexing for Real-Time Video Clip Search," ACM Trans. Information Systems, vol. 27, no. 3, pp. 1-33, 2009.
[17] H.V. Jagadish, B.C. Ooi, K.-L. Tan, C. Yu, and R. Zhang, "iDistance: An Adaptive B+-Tree Based Indexing Method for Nearest Neighbor Search," ACM Trans. Database Systems, vol. 30, no. 2, pp. 364-397, 2005.
[18] A.K. Jain, M.N. Murty, and P.J. Flynn, "Data Clustering: A Review," ACM Computing Surveys, vol. 31, no. 3, pp. 264-323, 1999.
[19] H. Jin, B.C. Ooi, H.T. Shen, C. Yu, and A. Zhou, "An Adaptive and Efficient Dimensionality Reduction Algorithm for High-Dimensional Indexing," Proc. Int'l Conf. Data Eng., pp. 87-98, 2003.
[20] C.T.Jr., A. Traina, B. Seeger, and C. Faloutsos, "Slim-Trees: High Performance Metric Trees Minimizing Overlap between Nodes," Proc. Int'l Conf. Extending Database Technology (EDBT), pp. 51-65, 2000.
[21] K. Kukich, "Technique for Automatically Correcting Words in Text," ACM Computing Surveys, vol. 24, no. 4, pp. 377-439, 1992.
[22] S.-L. Lee, S.-J. Chun, D.-H. Kim, J.-H. Lee, and C.-W. Chung, "Similarity Search for Multidimensional Data Sequences," Proc. Int'l Conf. Data Eng., pp. 599-608, 2000.
[23] J. Lee, J.-H. Oh, and S. Hwang, "Scenario Based Dynamic Video Abstractions Using Graph Matching," Proc. ACM Int'l Conf. Multimedia (MM), pp. 810-819, 2005.
[24] J. Lee, J.-H. Oh, and S. Hwang, "STRG Index: Spatio-Temporal Region Graph Indexing for Large Video Databases," Proc. ACM SIGMOD, pp. 718-729, 2005.
[25] J. Lin, E. Keogh, S. Lonardi, and B. Chiu, "A Symbolic Representation of Time Series, with Implications for Streaming Algorithms," Proc. Data Mining and Knowledge Discovery (DMKD): SIGMOD Workshop, pp. 2-11, 2003.
[26] V. Megalooikonomou, Q. Wang, G. Li, and C. Faloutsos, "A Multiresolution Symbolic Representation of Time Series," Proc. Int'l Conf. Data Eng., pp. 668-679, 2005.
[27] Y. Peng and C.-W. Ngo, "Clip-Based Similarity Measure for Query-Dependent Clip Retrieval and Video Summarization," IEEE Trans. Circuits and Systems for Video Technology, vol. 16, no. 5, pp. 612-627, May 2006.
[28] E.S. Ristad and P.N. Yianilos, "Learning String-Edit Distance," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 5, pp. 522-532, May 1998.
[29] J.T. Robinson, "The k-D-B-Tree: A Search Structure for Large Multidimensional Dynamic Indexes," Proc. ACM SIGMOD, pp. 10-18, 1981.
[30] H.T. Shen, B.C. Ooi, X. Zhou, and Z. Huang, "Towards Effective Indexing for Very Large Video Sequence Database," Proc. ACM SIGMOD, pp. 730-741, 2005.
[31] J. Sivic and A. Zisserman, "Video Google: A Text Retrieval Approach to Object Matching in Videos," Proc. Int'l Conf. Computer Vision (ICCV), vol. 2, pp. 1470-1477, 2003.
[32] J. Law-To, O. Buisson, V. Gouet-Brunet, and N. Boujemaa, "Robust Voting Algorithm Based on Labels of Behavior for Video Copy Detection," Proc. ACM Int'l Conf. Multimedia (MM), pp. 835-844, 2006.
[33] R. Weber, H. Schek, and S. Blott, "A Quantitative Analysis and Performance Study for Similarity Search Methods in High Dimensional Spaces," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 194-205, 1998.
[34] X. Wu, A.G. Hauptmann, and C.-W. Ngo, "Practical Elimination of Near-Duplicates from Web Video Search," Proc. ACM Int'l Conf. Multimedia (MM), pp. 218-227, 2007.
[35] X. Wu, W.-L. Zhao, and C.-W. Ngo, "Near-Duplicate Keyframe Retrieval with Visual Keywords and Semantic Context," Proc. ACM Int'l Conf. Image and Video Retrieval (CIVR), pp. 162-169, 2007.
[36] Y. Yan, B.C. Ooi, and A. Zhou, "Continuous Content-Based Copy Detection over Streaming Videos," Proc. Int'l Conf. Data Eng., pp. 853-862, 2008.
[37] C. Yu, B.C. Ooi, K.-L. Tan, and H.V. Jagadish, "Indexing the Distance: An Efficient Method to KNN Processing," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 421-430, 2001.
[38] X. Zhou, X. Zhou, A. Bouguettaya, and J.A. Taylor, "A Subspace Symbolization Approach to Content-Based Video Search," Proc. Int'l Conf. Data Eng., pp. 1191-1194, 2009.
[39] X. Zhou, X. Zhou, and H.T. Shen, "A New Similarity Measure for Near Duplicate Video Clip Detection," Proc. Joint Asia-Pacific Web Conf. (APWeb)/Int'l Conf. Web-Age Information Management (WAIM), pp. 176-187, 2007.
[40] X. Zhu, X. Wu, J. Fan, A.K. Elmagarmid, and W.G. Aref, "Exploring Video Content Structure for Hierarchical Summarization," Multimedia Systems, vol. 10, no. 2, pp. 98-115, 2004.
[41] J. Zobel, A. Moffat, and K. Ramamohanarao, "Inverted Files versus Signature Files for Text Indexing," ACM Trans. Database Systems, vol. 23, no. 4, pp. 453-490, 1998.
[42] J. Zobel and T.C. Hoad, "Detection of Video Sequences Using Compact Signatures," ACM Trans. Information Systems, vol. 24, no. 1, pp. 1-50, 2006.
24 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool