Skoči na glavni sadržaj

Izvorni znanstveni članak

Language Identification in the Context of Automatic Speech Understanding

E. Noth ; Lehrstuhl fur Mustererkennung (Informatik 5), Universitat Erlangen-Nurnberg, Germany
S. Harbeck ; Lehrstuhl fur Mustererkennung (Informatik 5), Universitat Erlangen-Nurnberg, Germany
H. Niemann ; Lehrstuhl fur Mustererkennung (Informatik 5), Universitat Erlangen-Nurnberg, Germany
V. Warnke ; Lehrstuhl fur Mustererkennung (Informatik 5), Universitat Erlangen-Nurnberg, Germany
I. Ipšić ; Laboratory for Artificial Perception, Faculty of Electrical and Computer Engineering, Ljubljana, Slovenia


Puni tekst: engleski pdf 3.661 Kb

str. 1-8

preuzimanja: 222

citiraj


Sažetak

We present two concepts for systems with language identification in the context of multilingual information retrieval dialogues. The first one has an explicit module for language identification. It is based on training a codebook for each language, running the language specific vector quantizers in parallel and integrating over the output probability of the best alternative in each language. The system can decide for one language either after a predefined time interval or if the difference between the probabilities of the languages succeeds a certain threshold . T his approach allows to recognize languages that the system cannot process and give out a prerecorded message in that language. In the second approach, the trained recognizers of the languages to be recognized, the lexicons, and the language models are combined to one multilingual recognizer. Only allowing transitions between the words from one language, each hypothesized word chain contains only words from one language and language identification is an implicit byproduct of the speech recognizer. First results for the explicit language identification are presented.

Ključne riječi

language identification; speech understanding; multilingual information; retrieval dialogues

Hrčak ID:

150301

URI

https://hrcak.srce.hr/150301

Datum izdavanja:

30.3.1996.

Posjeta: 544 *