Skoči na glavni sadržaj

Izvorni znanstveni članak

Text-to-Speech Synthesis: A Complete System for the Slovenian Language

Jerneja Gros ; Faculty tor Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia
Nikola Pavešić ; Faculty tor Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia
France Mihelič ; Faculty tor Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia


Puni tekst: engleski pdf 4.714 Kb

str. 11-19

preuzimanja: 192

citiraj


Sažetak

A text-to-speech system, capable of synthesising continuous Slovenian speech from an arbitrary input text is described. The text-to-speech system is based on the concatenation of basic speech units, diphones, using the TD-PSOLA technique, and no special hardware is required. The input text is transformed into its spoken equivalent by a series of the modules. The modules, constituting the text-to-speech system are described in detail. Special attention is paid to segmental duration determination, where the effect of speaking rate on phone duration is widely studied. Finally, the results of output speech quality assessment are given in terms of acceptability and intelligibility.

Ključne riječi

text-to-speech synthesis; diphone concatenation; prosody modelling; grapheme-to-phoneme conversion; Slovenian language

Hrčak ID:

150269

URI

https://hrcak.srce.hr/150269

Datum izdavanja:

30.3.1997.

Posjeta: 581 *