Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.31341/jios.44.1.2

Machine Translation System for the Industry Domain and Croatian Language

Ivan Dunđer ; Faculty of Humanities and Social Sciences, University of Zagreb, Zagreb, Croatia


Puni tekst: engleski pdf 819 Kb

str. 33-50

preuzimanja: 626

citiraj


Sažetak

Machine translation is increasingly becoming a hot research topic in information and communication sciences, computer science and computational linguistics, due to the fact that it enables communication and transferring of meaning across different languages. As the Croatian language can be considered low-resourced in terms of available services and technology, development of new domain-specific machine translation systems is important, especially due to raised interest and needs of industry, academia and everyday users. Machine translation is not perfect, but it is crucial to assure acceptable quality, which is purpose-dependent. In this research, different statistical machine translation systems were built – but one system utilized domain adaptation in particular, with the intention of boosting the output of machine translation. Afterwards, extensive evaluation has been performed – in form of applying several automatic quality metrics and human evaluation with focus on various aspects. Evaluation is done in order to assess the quality of specific machine-translated text.

Ključne riječi

statistical machine translation; domain adaptation; automatic quality metrics; human quality evaluation; error classification; Croatian language; information and communication sciences

Hrčak ID:

239780

URI

https://hrcak.srce.hr/239780

Datum izdavanja:

25.6.2020.

Posjeta: 1.473 *