ACS/IEEE 2005 International Conference on Computer Systems and Applications (AICCSA'05) Spoken information retrieval for multimedia databases Cairo, Egypt January 03-January 06 ISBN: 0-7803-8735-X
Summary form only given. This document describes the realization of a spoken information retrieval system and its application to word search into indexed multimedia databases. The multimedia database is build from a multiformat set of text, audio and video documents. The whole archive collection is indexed using preprocessing techniques to produce transcripts and indexing software tools to catalog them. The system uses a Java-based distributed client-server architecture. A Java applet is used to capture the audio signal for a spoken query, then it is transmitted to a server where an automatic speech recognition (ASR) software is applied to convert the signal into a transcripted hypothesis. Later, a query tool process the transcript sentence along with the indexed multimedia database and a set of pointers to documents are generated. Finally, a Web page with links to the resulting documents, where queried words appear, is presented to the user.
Citation:
L.R. Salgado-Garza, J.A. Nolazco-FIores, P.D. Diaz-Lopez, "Spoken information retrieval for multimedia databases," aiccsa, pp.146-vii, ACS/IEEE 2005 International Conference on Computer Systems and Applications (AICCSA'05), 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||