|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Acoustics, Speech, and Signal Processing, 1999. Proceedings. Vol 6, 1999 IEEE International Conference on
Video content extraction and representation using a joint audio and video processing
Phoenix, AZ, USA
March 15-March 19
ISBN: 0-7803-5041-3
| ASCII Text | x | ||
| C. Saraceno, "Video content extraction and representation using a joint audio and video processing," Acoustics, Speech, and Signal Processing, IEEE International Conference on, vol. 6, pp. 3033-3036, Acoustics, Speech, and Signal Processing, 1999. Proceedings. Vol 6, 1999 IEEE International Conference on, 1999. | |||
| BibTex | x | ||
| @article{ 10.1109/ICASSP.1999.757480, author = {C. Saraceno}, title = {Video content extraction and representation using a joint audio and video processing}, journal ={Acoustics, Speech, and Signal Processing, IEEE International Conference on}, volume = {6}, year = {1999}, isbn = {0-7803-5041-3}, pages = {3033-3036}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICASSP.1999.757480}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Acoustics, Speech, and Signal Processing, IEEE International Conference on TI - Video content extraction and representation using a joint audio and video processing SN - 0-7803-5041-3 SP3033 EP3036 A1 - C. Saraceno, PY - 1999 VL - 6 JA - Acoustics, Speech, and Signal Processing, IEEE International Conference on ER - | |||
Computer technology allows for large collections of digital archived material. At the same time, the increasing availability of potentially interesting data makes difficult the retrieval of desired information. Currently, access to such information is limited to textual queries or characteristics such as color or texture. The demand for new solutions allowing common users to easily access, store and retrieve relevant audio-visual information is becoming urgent. One possible solution to this problem is to hierarchically organize the audio-visual data so as to create a nested indexing structure which provides efficient access to relevant information at each level of the hierarchy. This work presents an automatic methodology to extract and hierarchically represent the semantics of the contents, based on a joint audio and visual analysis. Descriptions on each media (audio, video) are used to recognize higher level of meaningful structures, such as specific types of scenes, or, at the highest level, correlations beyond the temporal organization of information, allowing it to reflect classes of visual or audio or audio-visual types. Once a hierarchy is extracted from the data analysis, a nested indexing structure can be created to access relevant information at a specific level of detail, according to the user requirements.
Citation:
C. Saraceno, "Video content extraction and representation using a joint audio and video processing," icassp, vol. 6, pp.3033-3036, Acoustics, Speech, and Signal Processing, 1999. Proceedings. Vol 6, 1999 IEEE International Conference on, 1999
Usage of this product signifies your acceptance of the Terms of Use.
