This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A Speech Understanding and Dialog System with a Homogeneous Linguistic Knowledge Base
February 1994 (vol. 16 no. 2)
pp. 179-194

This article presents the speech understanding and dialog system EVAR. All levels of linguistic knowledge are used both to control the analysis process and for the interpretation of an utterance. All kinds of knowledge are integrated in a homogeneous knowledge base. The control algorithm used for the analysis is defined within the representation scheme and does not depend on the application. One of the aims of EVAR is to develop a system structure where linguistic and nonlinguistic expectations could be used not only for the interpretation but also as predictions for the recognition process.

[1] H. C. Bunt, "On-line interpretation in speech understanding and dialogue systems," inRecent Advances in Speech Understanding and Dialog Systems, H. Niemann, M. Lang, and G. Sagerer, Eds. Berlin: Springer-Verlag (NATO ASI Series F), 1988, vol. 46, pp. 349-396.
[2] A. Cappelli, G. Ferrari, L. Moretti, and I. Prodanof, "A framework for integrating syntax and semantics," inComputational Models of Natural Language Processing, B. G. Bara and G. Guida, Eds. Amsterdam: Elsevier (North-Holland), 1984, pp. 33-57.
[3] P. D'Orta, M. Ferretti, A. Martelli, and S. Scarci, "An automatic speech recognition system for the italian language," in3rd Conf. European Chapter of the ACL, Copenhagen, 1987, pp. 80-83.
[4] U. Ehrlich, "Bedeutungsanalyse in einem sprachverstehenden System unter Berücksichtigung pragmatischer Faktoren,"Sprache und Information, vol. 22. Tübingen: Max Niemeyer Verlag, 1990.
[5] J. C. Fillmore, "A case for case," inUniversals in Linguistic Theory, E. Bach and R. T. Harms, Eds. New York: Holt, Rinehart and Winston, 1968, pp. 1-88.
[6] P. K. Fink, "The acquisition and use of dialogue expectation in speech recognition," UMI Dissertation Information Service, Ann Arbor, MI, 1983.
[7] W. S. Havens, "Recognition mechanisms for schema-based knowledge representation," inComputational Linguistics, N. J. Cercone, Ed. Oxford: Pergamon, 1983, pp. 185-200.
[8] P. J. Hayes, A. G. Hauptmann, J. G. Carbonell, and M. Tomita, "Parsing spoken language: A semantic caseframe approach," inProc. 11th COLING, Bonn, 1986, pp. 587-592.
[9] L. Hitzenberger and H. Kritzenberger, "Simulation experiments and prototyping of user interfaces in a multimedial environment of an information system," inEurospeech 89: European Conf. Speech Commun. Technol., Eurospeech Congress, Paris, Sept. 1989, pp. 597-600.
[10] J. P. Ingria, "Natural language processing: Where it's been and where it might be going" inComputer Processing of Language Data (ROJP), Proc. 4th Conf., Portoroz, YU, 1988, pp. 59-74.
[11] T. Kuhn, H. Niemann, E. G. Schukat-Talamazzini, W. Eckert, and S. Rieck, "Context-dependent modeling in a two-stage hmm word recognizer for continuous speech," inSignal Processing VI: Theories and Applications (EUSIPCO-92), J. Vandewalle and A. Oosterlinck, Eds. Amsterdam: Elsevier, 1992, pp. 439-442.
[12] F. Kummert, "Flexible Steuerung eines sprachverstehenden Systems mit homogener Wissensbasis,"Dissertationen zur Künstlichen Intelligenz, vol. 12, Infix, Sankt, Aug. 1992.
[13] F. Kummert, G. Fink, G. Sagerer, and B. Seestaedt, "Erweiterungen einer dialogkomponente zur robusten verarbeitung gesprochener sprache," Interner Bericht, AG Angewandte Informatik, Universität Bielefeld, 1992.
[14] S. Kunzmann, T. Kuhn, and H. Niemann, "An experimental environment for generating word hypotheses in continuous speech," inRecent Advances in Speech Understanding and Dialog Systems(NATO ASI Series F), H. Niemann, M. Lang, and G. Sagerer, Eds. Berlin: Springer Verlag, vol. 46, 1988, pp. 311-316.
[15] S. E. Levinson and L. R. Rabiner, "A task-oriented conversational mode speech understanding system,"Bibliotheca Phonetica, vol. 12, pp. 149-196, 1985.
[16] L. M. Nortonet al., "Management and evaluation of interactive dialog in the air travel domain," inProc. DARPA Workshop, June 1990, pp. 141-146.
[17] J. Mudler and E. Paulus, "Expectation-based speech recognition," inRecent Advances in Speech Understanding and Dialog Systems, (NATO ASI Series F), H. Niemann, M. Lang, and G. Sagerer, Eds. Berlin: Springer-Verlag, vol. 46, 1988, pp. 473-477.
[18] H. Ney, D. Mergel, A. Noll, and A. Paeseler, "Overview of speech recognition in the spicos system," inRecent Advances in Speech Understanding and Dialog Systems, (NATO ASI Series F), H. Niemann, M. Lang, and G. Sagerer, Eds. Berlin: Springer-Verlag, vol. 46, 1988, pp. 305-310.
[19] G. Niedermair, "Lingustic modeling in the context of oral dialogue," inInt. Conf. Spoken Language Processing, Banff, AB, Canada, Oct. 12-16, 1962, pp. 635-638.
[20] G. Niedermair, "Syntax, semantik und dialog in SPICOS II," inSprachliche Mensch-Maschine-Kommunikation, H. Mangold, Ed. München: Oldenbourg-Verlag, 1992, pp. 91-102.
[21] H. Niemann, G. Sagerer, U. Ehrlich, G. Schukat-Talamazzini, and F. Kummert, "The interaction of word recognition and linguistic processing in speech understanding," inSpeech Recognition and Understanding(NATO ASI Series F 75), P. Laface and R. DeMori, Eds. Berlin, Heidelberg: Springer-Verlag, 1992, pp. 425-453.
[22] H. Niemann, G. Sagerer, S. Schröder, and F. Kummert, "Ernest: A semantic network system for pattern understanding,"IEEE Trans. Pattern Anal. Machine Intell., vol. 12, pp. 883-905, Dec. 1990.
[23] N. Nilsson,Principles of Artificial Intelligence. Palo Alto, CA: Tioga, 1980.
[24] E. Nöth and R. Kompe, "Der Einsatz prosodischer information im spracherkennungssystem evar," inMustererkennung 88, 10. DAGM-Symposium Zürich, Informatik-Fachberichte, H. Bunke, O. Kübler, and P. Stucki, Eds. Berlin: Springer-Verlag, 1988, pp. 2-9.
[25] N. Youd P. Heisterkamp, S. McGlashan, "Dialogue semantics for an oral dialogue system," inInt. Conf. Spoken Language Processing, Banff, AB, Canada, Oct. 12-16, 1992, pp. 643-646.
[26] G. Sagerer,Automatisches Verstehen gesprochener Sprache, Reihe Informatik, vol. 74. Mannheim: Bibliographisches Institut, 1990.
[27] G. Sagerer, U. Ehrlich, F. Kummert, H. Niemann, and E. G. Schukat-Talamazzini, "A flexible control strategy with multilevel judgements for a knowledge based speech understanding system," inProc. 9th Int. Conf. Pattern Recognition, Rome, 1988, pp. 788-790.
[28] G. Sagerer and F. Kummert, "Knowledge based systems for speech understanding," inRecent Advances in Speech Understanding and Dialog Systems(NATO ASI Series F), H. Niemann, M. Lang, and G. Sagerer, Eds. Berlin: Springer-Verlag, 1988, vol. 46, pp. 421-458.
[29] E. G. Schukat-Talamazzini, H. Niemann, W. Eckert, T. Kuhn, and S. Rieck, "Acoustic modeling of subword units in the isadora speech recognizer," inProc. Int. Conf. Acoust., Speech and Signal Processing, San Francisco, CA, vol. 1, pp. 577-580, 1992.
[30] M. Shigenaga, Y. Sekiguchi, T. Yagisawa, and K. Kato, "A speech recognition system of continuously spoken japanese sentences and an application to a speech input device," inProc. ICASSP, Tokyo, 1986, pp. 1577-1580.
[31] N. K. Sondheimer, R. M. Weischedel, and R. J. Bobrow, "Semantic interpretation using KL-ONE," inProc. 10th COLING, Prague, Czechoslovakia, 1984, pp. 101-107.
[32] L. Tesniere,Elements de syntaxe structurale, 2nd ed. Paris: Klincksieck, 1966.
[33] G. Thurmair, "Semantic processing in speech understanding," inRecent Advances in Speech Understanding and Dialog Systems, H. Niemann, M. Lang, and G. Sagerer, Eds. Berlin: Springer-Verlag (NATO ASI Series F), vol. 46, 1988, pp. 397-420.
[34] V. Zue, J. Glasset al., "Recent progress on the Voyager system," inDARPA Speech and Natural Language Workshop, June 24-27, 1990.
[35] S. J. Young and C. E. Proctor, "The design and implementation of dialogue control in voice operated database inquiry systems,"Comput. Speech&Language, vol. 3, no. 4, pp. 329-353, 1989.
[36] S.R. Young et al., "High-Level Knowledge Sources in Usable Speech Recognition Systems,"Comm. ACM, Vol. 32, Feb. 1989, pp. 183-194.

Index Terms:
speech recognition; speech analysis and processing; knowledge based systems; speech understanding system; dialog system; homogeneous linguistic knowledge base; EVAR
Citation:
M. Mast, F. Kummert, U. Ehrlich, G.A. Fink, T. Kuhn, H. Niemann, G. Sagerer, "A Speech Understanding and Dialog System with a Homogeneous Linguistic Knowledge Base," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 16, no. 2, pp. 179-194, Feb. 1994, doi:10.1109/34.273733
Usage of this product signifies your acceptance of the Terms of Use.