Skip to the main content

Original scientific paper

Language Identification in the Context of Automatic Speech Understanding

E. Noth ; Lehrstuhl fur Mustererkennung (Informatik 5), Universitat Erlangen-Nurnberg, Germany
S. Harbeck ; Lehrstuhl fur Mustererkennung (Informatik 5), Universitat Erlangen-Nurnberg, Germany
H. Niemann ; Lehrstuhl fur Mustererkennung (Informatik 5), Universitat Erlangen-Nurnberg, Germany
V. Warnke ; Lehrstuhl fur Mustererkennung (Informatik 5), Universitat Erlangen-Nurnberg, Germany
I. Ipšić ; Laboratory for Artificial Perception, Faculty of Electrical and Computer Engineering, Ljubljana, Slovenia


Full text: english pdf 3.661 Kb

page 1-8

downloads: 288

cite


Abstract

We present two concepts for systems with language identification in the context of multilingual information retrieval dialogues. The first one has an explicit module for language identification. It is based on training a codebook for each language, running the language specific vector quantizers in parallel and integrating over the output probability of the best alternative in each language. The system can decide for one language either after a predefined time interval or if the difference between the probabilities of the languages succeeds a certain threshold . T his approach allows to recognize languages that the system cannot process and give out a prerecorded message in that language. In the second approach, the trained recognizers of the languages to be recognized, the lexicons, and the language models are combined to one multilingual recognizer. Only allowing transitions between the words from one language, each hypothesized word chain contains only words from one language and language identification is an implicit byproduct of the speech recognizer. First results for the explicit language identification are presented.

Keywords

language identification; speech understanding; multilingual information; retrieval dialogues

Hrčak ID:

150301

URI

https://hrcak.srce.hr/150301

Publication date:

30.3.1996.

Visits: 905 *