Reordering of Source Side for a Factored English to Manipuri SMT System

Maibam, Indika; Purkayastha, Bipul Syam

doi:10.32985/ijeces.14.3.6

International journal of electrical and computer engineering systems, Vol. 14 No. 3, 2023.

Izvorni znanstveni članak

https://doi.org/10.32985/ijeces.14.3.6

Reordering of Source Side for a Factored English to Manipuri SMT System

Indika Maibam orcid.org/0000-0001-7695-9929 ; Department of Computer Science Indira Gandhi National Tribal University, Kangpokpi, Imphal, Manipur, India
Bipul Syam Purkayastha ; Department of Computer Science, Assam University, Silchar, Assam, India

Puni tekst: engleski pdf 404 Kb

str. 285-292

preuzimanja: 443

citiraj

APA 6th Edition

Maibam, I. i Purkayastha, B.S. (2023). Reordering of Source Side for a Factored English to Manipuri SMT System. International journal of electrical and computer engineering systems, 14 (3), 285-292. https://doi.org/10.32985/ijeces.14.3.6

MLA 8th Edition

Maibam, Indika i Bipul Syam Purkayastha. "Reordering of Source Side for a Factored English to Manipuri SMT System." International journal of electrical and computer engineering systems, vol. 14, br. 3, 2023, str. 285-292. https://doi.org/10.32985/ijeces.14.3.6. Citirano 30.05.2026.

Chicago 17th Edition

Maibam, Indika i Bipul Syam Purkayastha. "Reordering of Source Side for a Factored English to Manipuri SMT System." International journal of electrical and computer engineering systems 14, br. 3 (2023): 285-292. https://doi.org/10.32985/ijeces.14.3.6

Harvard

Maibam, I., i Purkayastha, B.S. (2023). 'Reordering of Source Side for a Factored English to Manipuri SMT System', International journal of electrical and computer engineering systems, 14(3), str. 285-292. https://doi.org/10.32985/ijeces.14.3.6

Vancouver

Maibam I, Purkayastha BS. Reordering of Source Side for a Factored English to Manipuri SMT System. International journal of electrical and computer engineering systems [Internet]. 2023 [pristupljeno 30.05.2026.];14(3):285-292. https://doi.org/10.32985/ijeces.14.3.6

IEEE

I. Maibam i B.S. Purkayastha, "Reordering of Source Side for a Factored English to Manipuri SMT System", International journal of electrical and computer engineering systems, vol.14, br. 3, str. 285-292, 2023. [Online]. https://doi.org/10.32985/ijeces.14.3.6

Sažetak

Similar languages with massive parallel corpora are readily implemented by large-scale systems using either Statistical Machine Translation (SMT) or Neural Machine Translation (NMT). Translations involving low-resource language pairs with linguistic divergence have always been a challenge. We consider one such pair, English-Manipuri, which shows linguistic divergence and belongs to the low resource category. For such language pairs, SMT gets better acclamation than NMT. However, SMT’s more prominent phrase- based model uses groupings of surface word forms treated as phrases for translation. Therefore, without any linguistic knowledge, it fails to learn a proper mapping between the source and target language symbols. Our model adopts a factored model of SMT (FSMT3*) with a part-of-speech (POS) tag as a factor to incorporate linguistic information about the languages followed by hand-coded reordering. The reordering of source sentences makes them similar to the target language allowing better mapping between source and target symbols. The reordering also converts long-distance reordering problems to monotone reordering that SMT models can better handle, thereby reducing the load during decoding time. Additionally, we discover that adding a POS feature data enhances the system’s precision. Experimental results using automatic evaluation metrics show that our model improved over phrase-based and other factored models using the lexicalised Moses reordering options. Our FSMT3* model shows an increase in the automatic scores of translation result over the factored model with lexicalised phrase reordering (FSMT2) by an amount of 11.05% (Bilingual Evaluation Understudy), 5.46% (F1), 9.35% (Precision), and 2.56% (Recall), respectively.

Ključne riječi

factored SMT; reordering; factoring; English; Manipuri; Automatic evaluation;

Hrčak ID:

296696

URI

https://hrcak.srce.hr/296696

Datum izdavanja:

28.3.2023.

Posjeta: 1.127 *

Prijava i registracija

International journal of electrical and computer engineering systems, Vol. 14 No. 3, 2023.

Sažetak

Ključne riječi

Hrčak ID:

URI

Datum izdavanja: