2001 IEEE International Conference on Multimedia and Expo (ICME'01) Model-Based Lip Synchronization With Automatically Translated Systhetic Voice Toward A Multi-Modal Translation System Tokyo, Japan August 22-August 25 ISBN: 0-7695-1198-8
In this paper, we introduce a multi-modal English-to-Japanese and Japanese-to-English translation system that also translates the speaker's speech motion while synchronizing it to the translated speech. To retain the speaker's facial expression, we substitute only the speech organ's image with the synthesized one, which is made by a three-dimensional wire-frame model that is adaptable to any speaker. Our approach enables image synthesis and translation with an extremely small database.
Citation:
Shin OGATA, Kazumasa MURAI, Satoshi NAKAMURA, Shigeo MORISHIMA, "Model-Based Lip Synchronization With Automatically Translated Systhetic Voice Toward A Multi-Modal Translation System," icme, pp.8, 2001 IEEE International Conference on Multimedia and Expo (ICME'01), 2001 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||