|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2005 IEEE International Conference on Multimedia and Expo
Speech Acquisition in Meetings with an Audio-Visual Sensor Array
Amsterdam, Netherlands
July 06-July 06
ISBN: 0-7803-9331-7
| ASCII Text | x | ||
| I. McCowan, M.H. Krishna, D. Gatica-Perez, D. Moore, null Sileye Ba, "Speech Acquisition in Meetings with an Audio-Visual Sensor Array," 2012 IEEE International Conference on Multimedia and Expo, pp. 1382-1385, 2005 IEEE International Conference on Multimedia and Expo, 2005. | |||
| BibTex | x | ||
| @article{ 10.1109/ICME.2005.1521688, author = {I. McCowan and M.H. Krishna and D. Gatica-Perez and D. Moore and null Sileye Ba}, title = {Speech Acquisition in Meetings with an Audio-Visual Sensor Array}, journal ={2012 IEEE International Conference on Multimedia and Expo}, volume = {0}, year = {2005}, isbn = {0-7803-9331-7}, pages = {1382-1385}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICME.2005.1521688}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - 2012 IEEE International Conference on Multimedia and Expo TI - Speech Acquisition in Meetings with an Audio-Visual Sensor Array SN - 0-7803-9331-7 SP1382 EP1385 A1 - I. McCowan, A1 - M.H. Krishna, A1 - D. Gatica-Perez, A1 - D. Moore, A1 - null Sileye Ba, PY - 2005 KW - null VL - 0 JA - 2012 IEEE International Conference on Multimedia and Expo ER - | |||
Close-talk headset microphones have been traditionally used for speech acquisition in a number of applications, as they naturally provide a higher signal-to-noise ratio - needed for recognition tasks than single distant microphones. However, in multi-party conversational settings like meetings, microphone arrays represent an important alternative to close-talking microphones, as they allow for localisation and tracking of speakers and signal-independent enhancement, while providing a non-intrusive, hands-free operation mode. In this article, we investigate the use of an audio-visual sensor array, composed of a small table-top microphone array and a set of cameras, for speaker tracking and speech enhancement in meetings. Our methodology first fuses audio and video for person tracking, and then integrates the output of the tracker with a beamformer for speech enhancement. We compare and discuss the features of the resulting speech signal with respect to that obtained from single close-talking and table-top microphones.
Citation:
I. McCowan, M.H. Krishna, D. Gatica-Perez, D. Moore, null Sileye Ba, "Speech Acquisition in Meetings with an Audio-Visual Sensor Array," icme, pp.1382-1385, 2005 IEEE International Conference on Multimedia and Expo, 2005
Usage of this product signifies your acceptance of the Terms of Use.
