|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02)
Multi-Modal Translation System and Its Evaluation
Pittsburgh, Pennsylvania
October 14-October 16
ISBN: 0-7695-1834-6
| ASCII Text | x | ||
| Shigeo Morishima, Satoshi Nakamura, "Multi-Modal Translation System and Its Evaluation," Multimodal Interfaces, IEEE International Conference on, pp. 241, Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02), 2002. | |||
| BibTex | x | ||
| @article{ 10.1109/ICMI.2002.1167000, author = {Shigeo Morishima and Satoshi Nakamura}, title = {Multi-Modal Translation System and Its Evaluation}, journal ={Multimodal Interfaces, IEEE International Conference on}, volume = {0}, year = {2002}, isbn = {0-7695-1834-6}, pages = {241}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICMI.2002.1167000}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Multimodal Interfaces, IEEE International Conference on TI - Multi-Modal Translation System and Its Evaluation SN - 0-7695-1834-6 SP EP A1 - Shigeo Morishima, A1 - Satoshi Nakamura, PY - 2002 KW - null VL - 0 JA - Multimodal Interfaces, IEEE International Conference on ER - | |||
Speech-to-speech translation has been studied to realize natural human communication beyond language barriers. Toward further multi-modal natural communication, visual information such as face and lip movements will be necessary. In this paper, we introduce a multi-modal English-to-Japanese and Japanese-to-English translation system that also translates the speaker's speech motion while synchronizing it to the translated speech. To retain the speaker's facial expression, we substitute only the speech organ's image with the synthesized one, which is made by a three-dimensional wire-frame model that is adaptable to any speaker. Our approach enables image synthesis and translation with an extremely small database. We conduct subjective evaluation tests using the connected digit discrimination test using data with and without audio-visual lip-synchronization. The results confirm the significant quality of the proposed audio-visual translation system and the importance of lip-synchronization.
Citation:
Shigeo Morishima, Satoshi Nakamura, "Multi-Modal Translation System and Its Evaluation," icmi, pp.241, Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.
