Skip to the main content

Original scientific paper

Development of Acoustic Model for Croatian Language Using HTK

Branimir Dropuljić orcid id orcid.org/0000-0001-5748-2643 ; Department of Electric Machines, Drives and Automation, Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia
Davor Petrinović orcid id orcid.org/0000-0003-3950-7864 ; Department of Electronic Systems and Information Processing, Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia


Full text: english pdf 207 Kb

page 79-88

downloads: 1.544

cite


Abstract

Paper presents development of the acoustic model for Croatian language for automatic speech recognition (ASR). Continuous speech recognition is performed by means of the Hidden Markov Models (HMM) implemented in the HMM Toolkit (HTK). In order to adjust the HTK to the native language a novel algorithm for Croatian language transcription (CLT) has been developed. It is based on phonetic assimilation rules that are applied within uttered words. Phonetic questions for state tying of different triphone models have also been developed. The automated system for training and evaluation of acoustic models has been developed and integrated with the new graphical user interface (GUI). Targeted applications of this ASR system are stress inoculation training (SIT) and virtual reality exposure therapy (VRET). Adaptability of the model to a closed set of speakers is important for such applications and this paper investigates the applicability of the HTK tool for typical scenarios. Robustness of the tool to a new language was tested in matched conditions by a parallel training of an English model that was used as a baseline. Ten native Croatian speakers participated in experiments. Encouraging results were achieved and reported with the developed model for Croatian language.

Keywords

Acoustic model; Automatic speech recognition; Croatian language; Hidden Markov models; Phonetic assimilation; Phonetic transcription algorithm; Recognition accuracy

Hrčak ID:

51368

URI

https://hrcak.srce.hr/51368

Publication date:

22.3.2010.

Article data in other languages: croatian

Visits: 3.323 *