This Article 
 Bibliographic References 
 Add to: 
A Computationally Compact Divergence Measure for Speech Processing
December 1991 (vol. 13 no. 12)
pp. 1255-1260

The directed divergence, which is a measure based on the discrimination information between two signal classes, is investigated. A simplified expression for computing the directed divergence is derived for comparing two Gaussian autoregressive processes such as those found in speech. This expression alleviates both the computational cost (reduced by two thirds) and the numerical problems encountered in computing the directed divergence. In addition, the simplified expression is compared with the Itakura-Saito distance (which asymptotically approaches the directed divergence). Although the expressions for these two distances closely resemble each other, only moderate correlations between the two were found on a set of actual speech data.

[1] S. Kullback,Information Theory and Statistics. New York: Dover, 1968.
[2] I. Csiszar, "I-divergence geometry of probability distributions and minimization problems,"Ann. Prob., vol. 3, pp. 146-158, 1975.
[3] T. Kailath, "The divegence and Bhattacharyya distance measures in signal detection,"IEEE Trans. Comm. Tech., vol. COM-15, pp. 52-60, Feb. 1967.
[4] R. Johnson, "Axiomatic characterization of the directed divergences and their linear combinations,"IEEE Trans. Inform. Theory, vol. IT-25, pp. 709-716, Nov. 1979.
[5] Y. Linde, A. Buzo, and R. M. Gray, "An algorithm for vector quantizer design,"IEEE Trans. Commun., vol. COM-28, pp. 84-95, Jan. 1980.
[6] J. Shore, "Minimum cross-entropy spectral analysis,"IEEE Trans. Acoust. Speech Signal Processing, vol. ASSP-29, pp. 230-237, Apr. 1981.
[7] R.M. Gray, A. Buzo, A.H. Gray, and Y. Matsuyama, "Distortion measures for speech processing,"IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 367-376, Aug. 1980.
[8] W. Gersch, F. Martinelli, J. Yonemoto, M. Low, and J. McEwan, "Automatic classification of electroencephalograms: Kullback-Leibler nearest neighbor rules,"Sci., vol. 34, pp. 193-195, July 1979.
[9] F. Itakura and S. Saito, "An analysis-synthesis telephony based on maximum likelihood method," inProc. Int. Cong. Acoust., 1968, vol. C-5-5, pp. C17-20.
[10] R. M. Gray, A. H. Gray Jr., G. Rebolledo, and J.E. Shore, "Rate-distortion speech coding with a minimum discrimination information distortion measure,"IEEE Trans. Inform. Theory, vol. IT-27, pp. 708-721, Nov. 1981.
[11] A. H. Gray, Jr. and J. D. Markel, "Distance measures for speech processing,"IEEE Trans Acoust. Speech Signal Processing, vol. ASSP- 24, pp. 380-391, Oct. 1976.
[12] G. Golub and C. Van Loan,Matrix Computations. Baltimore, MD: John Hopkins University Press, 1983.
[13] S. Marple, Jr.,Digital Spectral Analysis with Applications. Englewood Cliffs, NJ: Prentice-Hall, 1987.
[14] S. Levinson, L. Rabiner, A. Rosenberg, and J. Wilson, "Interactive clustering techniques for selecting speaker-independent reference templates for isolated word recognition,"IEEE Trans. Commun., vol. COM-28, pp. 84-95, Jan. 1980.

Index Terms:
computationally compact divergence measure; speech processing; discrimination information; signal classes; Gaussian autoregressive processes; Itakura-Saito distance; correlation methods; matrix algebra; speech analysis and processing
B.A. Carlson, M.A. Clements, "A Computationally Compact Divergence Measure for Speech Processing," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 13, no. 12, pp. 1255-1260, Dec. 1991, doi:10.1109/34.106999
Usage of this product signifies your acceptance of the Terms of Use.