A rule based prosody model for Turkish text-to-speech synthesis

Uslu, Ibrahim Baran; Ilk, Hakki Gokhan; Yilmaz, Asim Egemen

Technical gazette, Vol. 20 No. 2, 2013.

Original scientific paper

A rule based prosody model for Turkish text-to-speech synthesis

Ibrahim Baran Uslu ; Atilim University, Faculty of Engineering, Electrical-Electronics Eng. Dept., Kizilcasar Mahallesi 06836 Incek Ankara, Turkey
Hakki Gokhan Ilk ; Ankara University, Faculty of Engineering, Electrical-Electronics Eng. Dept., 06100 Tandogan Ankara, Turkey
Asim Egemen Yilmaz ; Ankara University, Faculty of Engineering, Electrical-Electronics Eng. Dept., 06100 Tandogan Ankara, Turkey

Full text: croatian pdf 879 Kb

page 217-223

downloads: 357

cite

APA 6th Edition

Uslu, I.B., Ilk, H.G. & Yilmaz, A.E. (2013). A rule based prosody model for Turkish text-to-speech synthesis. Tehnički vjesnik, 20 (2), 217-223. Retrieved from https://hrcak.srce.hr/100155

MLA 8th Edition

Uslu, Ibrahim Baran, et al. "A rule based prosody model for Turkish text-to-speech synthesis." Tehnički vjesnik, vol. 20, no. 2, 2013, pp. 217-223. https://hrcak.srce.hr/100155. Accessed 22 Dec. 2024.

Chicago 17th Edition

Uslu, Ibrahim Baran, Hakki Gokhan Ilk and Asim Egemen Yilmaz. "A rule based prosody model for Turkish text-to-speech synthesis." Tehnički vjesnik 20, no. 2 (2013): 217-223. https://hrcak.srce.hr/100155

Harvard

Uslu, I.B., Ilk, H.G., and Yilmaz, A.E. (2013). 'A rule based prosody model for Turkish text-to-speech synthesis', Tehnički vjesnik, 20(2), pp. 217-223. Available at: https://hrcak.srce.hr/100155 (Accessed 22 December 2024)

Vancouver

Uslu IB, Ilk HG, Yilmaz AE. A rule based prosody model for Turkish text-to-speech synthesis. Tehnički vjesnik [Internet]. 2013 [cited 2024 December 22];20(2):217-223. Available from: https://hrcak.srce.hr/100155

IEEE

I.B. Uslu, H.G. Ilk and A.E. Yilmaz, "A rule based prosody model for Turkish text-to-speech synthesis", Tehnički vjesnik, vol.20, no. 2, pp. 217-223, 2013. [Online]. Available: https://hrcak.srce.hr/100155. [Accessed: 22 December 2024]

Full text: english pdf 879 Kb

page 217-223

downloads: 940

cite

APA 6th Edition

Uslu, I.B., Ilk, H.G. & Yilmaz, A.E. (2013). A rule based prosody model for Turkish text-to-speech synthesis. Tehnički vjesnik, 20 (2), 217-223. Retrieved from https://hrcak.srce.hr/100155

MLA 8th Edition

Chicago 17th Edition

Harvard

Vancouver

IEEE

Abstract

This paper presents our novel prosody model in a Turkish text-to-speech synthesis (TTS) system. After developing a TTS system driven by parametric features consisting of duration, pitch and energy modifications, we try to figure out some prosody rules in order to increase the naturalness of our synthesizer. Since the inflected verbs in Turkish can be stand-alone sentences with the suffixes they take, we build a perceptual prosody model by defining rules on the stress patterns of verb inflections. Affirmative, negative and interrogative (both positive and negative) forms of many verbs were examined in a systematic way. Not only verbs, but in the same way, some phrases were examined for obtaining a proper prosody. According to the results of listening tests, the defined rules based on duration, pitch and energy modification weights, result in perceptually better speech synthesis, namely about 1,78/5,0 improvement in average in the CMOS (Comparative Mean Opinion Score) test. This improvement shows the success of our novel prosody model.

Keywords

CMOS test; diphone; natural speech; prosody; PSOLA; text-to-speech synthesis (TTS); verb inflection

Hrčak ID:

100155

URI

https://hrcak.srce.hr/100155

Publication date:

15.4.2013.

Article data in other languages: croatian

Visits: 2.420 *

Login and registration

Technical gazette, Vol. 20 No. 2, 2013.

Abstract

Keywords

Hrčak ID:

URI

Publication date: