|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
A Statistical Quality Model for Data-Driven Speech Animation
Nov. 2012 (vol. 18 no. 11)
pp. 1915-1927
| ASCII Text | x | ||
| Xiaohan Ma, Zhigang Deng, "A Statistical Quality Model for Data-Driven Speech Animation," IEEE Transactions on Visualization and Computer Graphics, vol. 18, no. 11, pp. 1915-1927, Nov., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TVCG.2012.67, author = { Xiaohan Ma and Zhigang Deng}, title = {A Statistical Quality Model for Data-Driven Speech Animation}, journal ={IEEE Transactions on Visualization and Computer Graphics}, volume = {18}, number = {11}, issn = {1077-2626}, year = {2012}, pages = {1915-1927}, doi = {http://doi.ieeecomputersociety.org/10.1109/TVCG.2012.67}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Visualization and Computer Graphics TI - A Statistical Quality Model for Data-Driven Speech Animation IS - 11 SN - 1077-2626 SP1915 EP1927 EPD - 1915-1927 A1 - Xiaohan Ma, A1 - Zhigang Deng, PY - 2012 KW - speech synthesis KW - computer animation KW - regression analysis KW - speech processing KW - interactive talking avatar applications KW - statistical quality model KW - data-driven speech animation approach KW - animation quality KW - SAQP KW - novel statistical model KW - on-the-fly synthesized speech animations KW - data-driven techniques KW - speech animation trajectory fitting metric KW - SATF KW - statistical regression model KW - Animation KW - Speech KW - Trajectory KW - Measurement KW - Principal component analysis KW - Predictive models KW - Face KW - statistical models KW - Facial animation KW - data-driven KW - visual speech animation KW - lip-sync KW - quality prediction VL - 18 JA - IEEE Transactions on Visualization and Computer Graphics ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TVCG.2012.67
Web Extra: View Supplemental Material(AVI)
In recent years, data-driven speech animation approaches have achieved significant successes in terms of animation quality. However, how to automatically evaluate the realism of novel synthesized speech animations has been an important yet unsolved research problem. In this paper, we propose a novel statistical model (called SAQP) to automatically predict the quality of on-the-fly synthesized speech animations by various data-driven techniques. Its essential idea is to construct a phoneme-based, Speech Animation Trajectory Fitting (SATF) metric to describe speech animation synthesis errors and then build a statistical regression model to learn the association between the obtained SATF metric and the objective speech animation synthesis quality. Through delicately designed user studies, we evaluate the effectiveness and robustness of the proposed SAQP model. To the best of our knowledge, this work is the first-of-its-kind, quantitative quality model for data-driven speech animation. We believe it is the important first step to remove a critical technical barrier for applying data-driven speech animation techniques to numerous online or interactive talking avatar applications.
Index Terms:
speech synthesis,computer animation,regression analysis,speech processing,interactive talking avatar applications,statistical quality model,data-driven speech animation approach,animation quality,SAQP,novel statistical model,on-the-fly synthesized speech animations,data-driven techniques,speech animation trajectory fitting metric,SATF,statistical regression model,Animation,Speech,Trajectory,Measurement,Principal component analysis,Predictive models,Face,statistical models,Facial animation,data-driven,visual speech animation,lip-sync,quality prediction
Citation:
Xiaohan Ma, Zhigang Deng, "A Statistical Quality Model for Data-Driven Speech Animation," IEEE Transactions on Visualization and Computer Graphics, vol. 18, no. 11, pp. 1915-1927, Nov. 2012, doi:10.1109/TVCG.2012.67
Usage of this product signifies your acceptance of the Terms of Use.

