Circuits, Communications and Systems, Pacific-Asia Conference on (2009)
May 16, 2009 to May 17, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PACCS.2009.72
With the development of speech recognition, speech data mining becomes a hot topic in fields of data mining and natural language processing. In this paper, a novel clustering algorithm is presented to describe how to do semantic mining and how to understand the developing trend of event implied in speech sequence. At first, the speech sequences are extracted into a Baysian network presenting the relationship between different speech elements. Then, we utilize a 3-dimensional space and sequence cluster techniques to excavate implied information from speech. Considering speech data features, we improve traditional distance-based clustering algorithm to get semantic information and enhance performance. The experimental results show that our algorithm is correct and effective.
Speech data mining, Sequence cluster, Frequent sequence, Baysian Network
F. Zhao, D. Wu, P. Yuan and H. Jin, "A Novel Clustering Algorithm for Mining Speech Data Using Baysian Network-Based Mutliple Model," 2009 Pacific-Asia Conference on Circuits, Communications and Systems (PACCS 2009)(PACCS), Chengdu, 2009, pp. 617-620.