Unsupervised Speaker Change Detection Using SVM Training Misclassification Rate September 2007 (vol. 56 no. 9) pp. 1234-1244
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TC.2007.70746
Abstract—This work presents an unsupervised speaker change detection algorithm based on support vector machine (SVM) to detect speaker change in a speech stream. The proposed algorithm is called the SVM training misclassification rate (STMR). The STMR can identify speaker changes with less speech data collection, making it capable of detecting speaker segments with short duration. According to experiments on the NIST Rich Transcription 2005 Spring Evaluation (RT-05S) corpus, the STMR has a missed detection rate of only 19.67%.
Index Terms:
Speaker segmentation, Support Vector Machine, Speaker Change Detection
Citation:
Po-Chuan Lin, Jia-Ching Wang, Jhing-Fa Wang, Hao-Ching Sung, "Unsupervised Speaker Change Detection Using SVM Training Misclassification Rate," IEEE Transactions on Computers, vol. 56, no. 9, pp. 1234-1244, June 2007, doi:10.1109/TC.2007.70746 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||