hrcak mascot   Srce   HID

Izvorni znanstveni članak

Development of Acoustic Model for Croatian Language Using HTK

Branimir Dropuljić   ORCID icon orcid.org/0000-0001-5748-2643 ; Department of Electric Machines, Drives and Automation, Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia
Davor Petrinović ; Department of Electronic Systems and Information Processing, Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia

Puni tekst: engleski, pdf (207 KB) str. 79-88 preuzimanja: 972* citiraj
APA 6th Edition
Dropuljić, B. i Petrinović, D. (2010). Development of Acoustic Model for Croatian Language Using HTK. Automatika, 51 (1), 79-88. Preuzeto s https://hrcak.srce.hr/51368
MLA 8th Edition
Dropuljić, Branimir i Davor Petrinović. "Development of Acoustic Model for Croatian Language Using HTK." Automatika, vol. 51, br. 1, 2010, str. 79-88. https://hrcak.srce.hr/51368. Citirano 25.06.2019.
Chicago 17th Edition
Dropuljić, Branimir i Davor Petrinović. "Development of Acoustic Model for Croatian Language Using HTK." Automatika 51, br. 1 (2010): 79-88. https://hrcak.srce.hr/51368
Harvard
Dropuljić, B., i Petrinović, D. (2010). 'Development of Acoustic Model for Croatian Language Using HTK', Automatika, 51(1), str. 79-88. Preuzeto s: https://hrcak.srce.hr/51368 (Datum pristupa: 25.06.2019.)
Vancouver
Dropuljić B, Petrinović D. Development of Acoustic Model for Croatian Language Using HTK. Automatika [Internet]. 2010 [pristupljeno 25.06.2019.];51(1):79-88. Dostupno na: https://hrcak.srce.hr/51368
IEEE
B. Dropuljić i D. Petrinović, "Development of Acoustic Model for Croatian Language Using HTK", Automatika, vol.51, br. 1, str. 79-88, 2010. [Online]. Dostupno na: https://hrcak.srce.hr/51368. [Citirano: 25.06.2019.]

Sažetak
Paper presents development of the acoustic model for Croatian language for automatic speech recognition (ASR). Continuous speech recognition is performed by means of the Hidden Markov Models (HMM) implemented in the HMM Toolkit (HTK). In order to adjust the HTK to the native language a novel algorithm for Croatian language transcription (CLT) has been developed. It is based on phonetic assimilation rules that are applied within uttered words. Phonetic questions for state tying of different triphone models have also been developed. The automated system for training and evaluation of acoustic models has been developed and integrated with the new graphical user interface (GUI). Targeted applications of this ASR system are stress inoculation training (SIT) and virtual reality exposure therapy (VRET). Adaptability of the model to a closed set of speakers is important for such applications and this paper investigates the applicability of the HTK tool for typical scenarios. Robustness of the tool to a new language was tested in matched conditions by a parallel training of an English model that was used as a baseline. Ten native Croatian speakers participated in experiments. Encouraging results were achieved and reported with the developed model for Croatian language.

Ključne riječi
Acoustic model; Automatic speech recognition; Croatian language; Hidden Markov models; Phonetic assimilation; Phonetic transcription algorithm; Recognition accuracy

Hrčak ID: 51368

URI
https://hrcak.srce.hr/51368

[hrvatski]

Posjeta: 1.419 *