hrcak mascot   Srce   HID

Izvorni znanstveni članak
https://doi.org/10.20532/cit.2017.1003398

Mix Multiple Features to Evaluate the Content and the Linguistic Quality of Text Summaries

Samira Ellouze ; University of Sfax, Faculty of Economics and Management of Sfax, Sfax, Tunisia
Maher Jaoua ; University of Sfax, Faculty of Economics and Management of Sfax, Sfax, Tunisia
Lamia Hadrich Belguith ; University of Sfax, Faculty of Economics and Management of Sfax, Sfax, Tunisia

Puni tekst: engleski, pdf (460 KB) str. 149-166 preuzimanja: 250* citiraj
APA 6th Edition
Ellouze, S., Jaoua, M. i Hadrich Belguith, L. (2017). Mix Multiple Features to Evaluate the Content and the Linguistic Quality of Text Summaries. Journal of computing and information technology, 25 (2), 149-166. https://doi.org/10.20532/cit.2017.1003398
MLA 8th Edition
Ellouze, Samira, et al. "Mix Multiple Features to Evaluate the Content and the Linguistic Quality of Text Summaries." Journal of computing and information technology, vol. 25, br. 2, 2017, str. 149-166. https://doi.org/10.20532/cit.2017.1003398. Citirano 28.02.2020.
Chicago 17th Edition
Ellouze, Samira, Maher Jaoua i Lamia Hadrich Belguith. "Mix Multiple Features to Evaluate the Content and the Linguistic Quality of Text Summaries." Journal of computing and information technology 25, br. 2 (2017): 149-166. https://doi.org/10.20532/cit.2017.1003398
Harvard
Ellouze, S., Jaoua, M., i Hadrich Belguith, L. (2017). 'Mix Multiple Features to Evaluate the Content and the Linguistic Quality of Text Summaries', Journal of computing and information technology, 25(2), str. 149-166. https://doi.org/10.20532/cit.2017.1003398
Vancouver
Ellouze S, Jaoua M, Hadrich Belguith L. Mix Multiple Features to Evaluate the Content and the Linguistic Quality of Text Summaries. Journal of computing and information technology [Internet]. 2017 [pristupljeno 28.02.2020.];25(2):149-166. https://doi.org/10.20532/cit.2017.1003398
IEEE
S. Ellouze, M. Jaoua i L. Hadrich Belguith, "Mix Multiple Features to Evaluate the Content and the Linguistic Quality of Text Summaries", Journal of computing and information technology, vol.25, br. 2, str. 149-166, 2017. [Online]. https://doi.org/10.20532/cit.2017.1003398

Sažetak
In this article, we propose a method of text summary's content and linguistic quality evaluation that is based on a machine learning approach. This method operates by combining multiple features to build predictive models that evaluate the content and the linguistic quality of new summaries (unseen) constructed from the same source documents as the summaries used in the training and the validation of models. To obtain the best model, many single and ensemble learning classifiers are tested. Using the constructed models, we have achieved a good performance in predicting the content and the linguistic quality scores. In order to evaluate the summarization systems, we calculated the system score as the average of the score of summaries that are built from the same system. Then, we evaluated the correlation of the system score with the manual system score. The obtained correlation indicates that the system score outperforms the baseline scores.

Ključne riječi
text summary; summary evaluation; content; linguistic quality; machine learning

Hrčak ID: 183330

URI
https://hrcak.srce.hr/183330

Posjeta: 373 *