loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2001 IEEE International Conference on Multimedia and Expo (ICME'01)
SEMANTIC BASED RETRIEVAL MODEL FOR DIGITAL AUDIO AND VIDEO
Tokyo, Japan
August 22-August 25
ISBN: 0-7695-1198-8
Surya Nepal, CSIRO Mathematical and Information Sciences, Locked Bag 17, North Ryde NSW 1670, Australia
Uma Srinivasan, CSIRO Mathematical and Information Sciences, Locked Bag 17, North Ryde NSW 1670, Australia
Graham Reynolds, CSIRO Mathematical and Information Sciences, Locked Bag 17, North Ryde NSW 1670, Australia
Recent content-based retrieval systems such as QBIC [7] and VisualSEEk [8] use low-level audio-visual features such as color, pan, zoom, and loudness for retrieval. However, users prefer to retrieve videos using high-level semantics based on their perception such as "bright color" and "very loud sound". This results in a gap between what users would like and what systems can generate. This paper is an attempt to bridge this gap by mapping users' perception (of semantic concepts) to lowlevel feature values. This paper proposes a model for providing high-level semantics for an audio feature that determines loudness. We first perform a pilot user study to capture the user perception of loudness level on a collection of audio clips of sound effects, and map them to five different semantic terms. We then describe how the loudness measure in MPEG-1 layer II audio files can be mapped to user perceived loudness. We then devise a fuzzy technique for retrieving audio/video clips from the collections using those semantic terms.
Citation:
Surya Nepal, Uma Srinivasan, Graham Reynolds, "SEMANTIC BASED RETRIEVAL MODEL FOR DIGITAL AUDIO AND VIDEO," icme, pp.285, 2001 IEEE International Conference on Multimedia and Expo (ICME'01), 2001
Usage of this product signifies your acceptance of the Terms of Use.