IEEE-INNS-ENNS International Joint Conference on Neural Networks (IJCNN'00)-Volume 5
Capture Inter-Speaker Information with a Neural Network for Speaker Identification
Como, Italy
July 24-July 27
ISBN: 0-7695-0619-4
Many speaker identification systems are created by model-based approaches, where a statistical model is used to characterize speaker's voice and no inter-speaker information is used in parameter estimation. It is well known that inter-speaker information is very helpful in discrimination of different speakers. In this paper, we propose a novel method for the use of inter-speaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture inter-speaker information from output space of those statistical models. In order to sufficiently utilize inter-speaker information, a rival penalized encoding rule is proposed to design supervised learning pairs for training the neural network. Comparative results in the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system.
Citation:
Lan Wang, Ke Chen, Huisheng Chi, "Capture Inter-Speaker Information with a Neural Network for Speaker Identification," ijcnn, vol. 5, pp.5247, IEEE-INNS-ENNS International Joint Conference on Neural Networks (IJCNN'00)-Volume 5, 2000