hrcak mascot   Srce   HID

Original scientific paper
https://doi.org/10.2498/cit.1001917

Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?

Nikola Ljubešić ; Department of Information Sciences, Faculty of Humanities and Social Sciences, University of Zagreb, Croatia
Petra Bago ; Department of Information Sciences, Faculty of Humanities and Social Sciences, University of Zagreb, Croatia
Damir Boras ; Department of Information Sciences, Faculty of Humanities and Social Sciences, University of Zagreb, Croatia

Fulltext: english, pdf (165 KB) pages 303-308 downloads: 270* cite
APA 6th Edition
Ljubešić, N., Bago, P. & Boras, D. (2010). Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?. Journal of computing and information technology, 18 (4), 303-308. https://doi.org/10.2498/cit.1001917
MLA 8th Edition
Ljubešić, Nikola, et al. "Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?." Journal of computing and information technology, vol. 18, no. 4, 2010, pp. 303-308. https://doi.org/10.2498/cit.1001917. Accessed 24 Jul. 2019.
Chicago 17th Edition
Ljubešić, Nikola, Petra Bago and Damir Boras. "Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?." Journal of computing and information technology 18, no. 4 (2010): 303-308. https://doi.org/10.2498/cit.1001917
Harvard
Ljubešić, N., Bago, P., and Boras, D. (2010). 'Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?', Journal of computing and information technology, 18(4), pp. 303-308. https://doi.org/10.2498/cit.1001917
Vancouver
Ljubešić N, Bago P, Boras D. Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?. Journal of computing and information technology [Internet]. 2010 [cited 2019 July 24];18(4):303-308. https://doi.org/10.2498/cit.1001917
IEEE
N. Ljubešić, P. Bago and D. Boras, "Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?", Journal of computing and information technology, vol.18, no. 4, pp. 303-308, 2010. [Online]. https://doi.org/10.2498/cit.1001917

Abstracts
This research is the first step towards developing a system
for translating Croatian weather forecasts into multiple
languages. This step deals with the Croatian-English
language pair. The parallel corpus consists of a one-year
sample of the weather forecasts for the Adriatic, consisting
of 7,893 sentence pairs. Evaluation is performed
by the automatic evaluation measures BLUE, NIST and
METEOR, as well as by manually evaluating a sample of
200 translations. We have shown that with a small-sized
training set and the state-of-the artMoses system, decoding
can be done with 96% accuracy concerning adequacy
and fluency. Additional improvement is expected by
increasing the training set size. Finally, the correlation
of the recorded evaluation measures is explored.

Keywords
statistical machine translation; automatic evaluation; manual evaluation; correlation between evaluation measures

Hrčak ID: 63862

URI
https://hrcak.srce.hr/63862

Visits: 432 *