Skip to the main content

Original scientific paper

https://doi.org/10.2498/cit.1002234

Semantic and Contextual Knowledge Representation for Lexical Disambiguation: Case of Arabic-French Query Translation

Souheyl Mallat ; Department of Computer Sciences, University of Monastir, Tunisia, LATICE Laboratory Research
Mohamed Achraf Ben Mohamed ; Department of Computer Sciences, University of Monastir, Tunisia, LATICE Laboratory Research
Emna Hkiri ; Department of Computer Sciences, University of Monastir, Tunisia, LATICE Laboratory Research
Anis Zouaghi ; Department of Computer Sciences, Higher Institute of Applied Science and Technologies Sousse, Tunisia, LATICE Laboratory Research
Mounir Zrigui ; Department of Computer Sciences, University of Monastir, Tunisia, LATICE Laboratory Research


Full text: english PDF 1.079 Kb

page 191-215

downloads: 1.062

cite


Abstract

We present in this paper, an automatic query translation system in cross-language information retrieval (Arabic-French). For the lexical disambiguation, our system combines between two resources: a bilingual dictionary and a parallel corpus. To select the best translation, our method is based on a correspondence measure between two semantic networks. The first one represents the senses of ambiguous terms of the query. The second one is a semantic network contextually enriched, representing the collection of sentences responding to the query. This collection forms the knowledge base of our disambiguation method and it is obtained by alignment with the relevant sentences in Arabic. The evaluation of the proposed system shows the advantage of the contextual enrichment on the quality of the translation. We obtained a high precision, relatively proportional to the precision provided by the used alignment. Finally, our translation demonstrates its potential by comparing its Bleu score with that of Google translate.

Keywords

cross-language information retrieval systems; machine translation; lexical disambiguation; semantic and conceptual indexing; contextual relations; matching; automatic evaluation metrics

Hrčak ID:

129194

URI

https://hrcak.srce.hr/129194

Publication date:

31.10.2014.

Visits: 2.003 *