The Community for Technology Leaders
2006 IEEE International Conference on Multimedia and Expo (2006)
Toronto, ON, Canada
July 9, 2006 to July 12, 2006
ISBN: 1-4244-0366-7
pp: 1653-1656
Mingyu You , College of Computer Science, YuQuan Campus, ZheJiang University, Hangzhou, CHINA, 310027. roseyoumy@zju.edu.cn
Chun Chen , College of Computer Science, YuQuan Campus, ZheJiang University, Hangzhou, CHINA, 310027. chenc@zju.edu.cn
Jiajun Bu , College of Computer Science, YuQuan Campus, ZheJiang University, Hangzhou, CHINA, 310027. bjj@zju.edu.cn
Jia Liu , College of Computer Science, YuQuan Campus, ZheJiang University, Hangzhou, CHINA, 310027. liujia@zju.edu.cn
Jianhua Tao , National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Beijing, CHINA, 100080. jhtao@nlpr.ia.ac.cn
ABSTRACT
This paper presents an emotion recognition system from clean and noisy speech. Geodesic distance was adopted to preserve the intrinsic geometry of emotional speech. Based on the geodesic distance estimation, an enhanced Lipschitz embedding was developed to embed the 64-dimensional acoustic features into a six-dimensional space. In order to avoid the problems brought by noise reduction, emotion recognition from noisy speech was performed directly. Linear Discriminant Analysis (LDA), Principal Component Analysis (PCA) and feature selection by Sequential Forward Selection (SFS) with Support Vector Machine (SVM) were also included to compress acoustic features before classifying the emotional states of clean and noisy speech. Experimental results demonstrate that compared with other methods, the proposed system makes approximately 10% improvement. The performance of our system is also robust when speech data is corrupted by increasing noise.
INDEX TERMS
null
CITATION

J. Tao, M. You, C. Chen, J. Liu and J. Bu, "Emotion Recognition from Noisy Speech," 2006 IEEE International Conference on Multimedia and Expo(ICME), Toronto, ON, Canada, 2006, pp. 1653-1656.
doi:10.1109/ICME.2006.262865
96 ms
(Ver 3.3 (11022016))