This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Data Semantics for Improving Retrieval Performance of Digital News Video Systems
May/June 2001 (vol. 13 no. 3)
pp. 352-360

Abstract—We propose a novel four-step hybrid approach for retrieval and composition of video newscasts based on information contained in different metadata sets. In the first step, we use conventional retrieval techniques to isolate video segments from the data universe using segment metadata. In the second step, retrieved segments are clustered into potential news items using a dynamic technique sensitive to the information contained in the segments. In the third step, we apply a transitive search technique to increase the recall of the retrieval system. In the final step, we increase recall performance by identifying segments possessing creation-time relationships. A quantitative analysis of the performance of the process on a newscast composition shows an increase in recall by 59 percent over the conventional keyword-based search technique used in the first step.

[1] G. Ahanger and T.D.C. Little, “The Canvass News Video Composition,” online demonstration,http://hulk.bu.edu/projects/fapdemo.html , 1999.
[2] G. Ahanger and T.D.C. Little, “Automatic Composition Techniques for Video Production,” IEEE Trans. Knowledge and Data Engineering, vol. 10, no. 6, pp. 967-987, Nov./Dec. 1998.
[3] G. Ahanger and T.D.C. Little, “A Language to Support Automatic Composition of Newscasts,” Computing and Information Technology, vol. 6, no. 3, pp. 297-310, 1998.
[4] G. Ahanger and T.D.C Little, “A System for Customized News Delivery from Video Archives,” Proc. Int'l Conf. Multimedia Computing and Systems, pp. 526-533, 1997.
[5] G. Ahanger and T.D.C Little, “A Survey of Technologies for Parsing and Indexing Digital Video,” Visual Comm. and Image Representation, vol. 7, no. 1, pp. 28-43, 1996.
[6] A. Akutsu and Y. Tonomura, "Video Tomography: An Efficient Method for Camerawork Extraction and Motion Analysis," Proc. ACM Multimedia 94, ACM Press, New York, 1994, pp. 349-356.
[7] E. Ardizzone and M. La Casia, “Automatic Video Database Indexing and Retrieval,” Multimedia Tools and Applications, vol. 4, no. 1, pp. 29-56, 1997.
[8] E. Branigan, “Narrative Schema,” Narrative Comprehension and Film, pp. 1-32, 1992.
[9] M.G. Brown, J.T. Foote, G.J.F. Jones, K.S. Jones, and S.J. Young, “Automatic Content-Based Retrieval of Broadcast News,” Proc. ACM Multimedia '95, pp. 35-43, 1995.
[10] C. Buckley, “Implementation of the SMART Information Retrieval System,” Technical Report No. TR85-686, Computer Science Department, Cornell Univ., 1985.
[11] M. Carrer, L. Ligresti, G. Ahanger, and T.D.C. Little, “An Annotation Engine for Supporting Video Database Population,” Multimedia Tools and Applications, vol. 5, no. 3, pp. 233-258, Nov. 1997.
[12] S.-F. Chang, J.R. Smith, M. Beigi, and A. Benitez, “Visual Information Retrieval from Large Distributed Online Repositories,” Comm. ACM, vol. 40, no. 12, pp. 63-71, 1997.
[13] T.H. Cormen,C.E. Leiserson, and R.L. Rivest,Introduction to Algorithms.Cambridge, Mass.: MIT Press/McGraw-Hill, 1990.
[14] D.L. Davies and D.W. Bouldin, “A Cluster Separation Measure,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 1. no. 2, Apr. 1979.
[15] R.O. Duda, P.E. Hart, Pattern Classification and Scene Analysis. John Wiley&Sons, 1973.
[16] J. Hafner, H.S. Sawhney, W. Equitz, M. Flickner, and W. Niblack, “Efficient Color Histogram Indexing for Quadratic Form Distance Functions,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 7, pp. 729-736, July 1995.
[17] J.A Hartigan,Clustering Algorithms, John Wiley and Sons, New York, N.Y., 1975.
[18] W. Klippgen, T.D.C. Little, G. Ahanger, and D. Venkatesh, “The Use of Metadata for the Rendering of Personalized Video Delivery,” Multimedia Data Management: Using Metadata to Integrate and Apply Digital Media, A. Sheth and W. Klas, eds., pp. 287-318, 1998.
[19] R. Lienhart, S. Pfeiffer, and W. Effelsberg, "Video Abstracting," Comm. ACM, Vol. 40, No. 12, 1997, pp. 54-62.
[20] V.E. Ogle and M. Stonebreaker, “Chabot: Retrieval from a Relational Database of Images,” Computer, vol. 28, no. 2, pp. 49-56, 1995.
[21] R.W. Picard and T.P. Minka, “Vision Texture for Annotation,” Multimedia Systems, vol. 3, pp. 3-14, 1995.
[22] G. Salton and M. McGill, Introduction to Modern Information Retrieval, McGraw Hill, New York, 1983.
[23] S. Santini and R. Jain, “Similarity is a Geometer,” Multimedia Tools and Applications, vol. 5, no. 3, pp. 277-306, 1997.
[24] S. Sclaroff and J. Isidoro, “Active Blobs,” Proc. Sixth Int'l Conf. Computer Vision, pp. 1146-1153, 1998.
[25] J.S. Wachman, “A Video Browser that Learns by Example,” master's thesis, Technical Report #383, MIT Media Laboratory, Cambridge, Mass., 1997.
[26] H. Wactlar, T. Kanade, M.A. Smith, and S.M. Stevens, “Intelligent Access to Digital Video: The Informedia Project,” Computer, vol. 29, no. 5, pp. 46-52, 1996.

Index Terms:
News video composition, retrieval, content metadata, structural metadata, unstructured metadata, keyword vector, recall, precision.
Citation:
Gulrukh Ahanger, Thomas D.C. Little, "Data Semantics for Improving Retrieval Performance of Digital News Video Systems," IEEE Transactions on Knowledge and Data Engineering, vol. 13, no. 3, pp. 352-360, May-June 2001, doi:10.1109/69.929894
Usage of this product signifies your acceptance of the Terms of Use.