Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning

Aljuhani, Khulood O.; Alyoubi, Khaled H.; Alotaibi, Fahd S.

doi:10.31803/tg-20220305120018

Technical Journal, Vol. 16 No. 3, 2022.

Review article

https://doi.org/10.31803/tg-20220305120018

Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning

Khulood O. Aljuhani orcid.org/0000-0002-3975-7559 ; Information Systems Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Khaled H. Alyoubi ; Information Systems Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Fahd S. Alotaibi ; Information Systems Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia

Full text: english pdf 944 Kb

page 394-400

downloads: 1.011

cite

APA 6th Edition

Aljuhani, K.O., Alyoubi, K.H. & Alotaibi, F.S. (2022). Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning. Tehnički glasnik, 16 (3), 394-400. https://doi.org/10.31803/tg-20220305120018

MLA 8th Edition

Aljuhani, Khulood O., et al. "Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning." Tehnički glasnik, vol. 16, no. 3, 2022, pp. 394-400. https://doi.org/10.31803/tg-20220305120018. Accessed 23 Jul. 2026.

Chicago 17th Edition

Aljuhani, Khulood O., Khaled H. Alyoubi and Fahd S. Alotaibi. "Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning." Tehnički glasnik 16, no. 3 (2022): 394-400. https://doi.org/10.31803/tg-20220305120018

Harvard

Aljuhani, K.O., Alyoubi, K.H., and Alotaibi, F.S. (2022). 'Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning', Tehnički glasnik, 16(3), pp. 394-400. https://doi.org/10.31803/tg-20220305120018

Vancouver

Aljuhani KO, Alyoubi KH, Alotaibi FS. Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning. Tehnički glasnik [Internet]. 2022 [cited 2026 July 23];16(3):394-400. https://doi.org/10.31803/tg-20220305120018

IEEE

K.O. Aljuhani, K.H. Alyoubi and F.S. Alotaibi, "Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning", Tehnički glasnik, vol.16, no. 3, pp. 394-400, 2022. [Online]. https://doi.org/10.31803/tg-20220305120018

Abstract

In recent years, social media networks are emerging as a key player by providing platforms for opinions expression, communication, and content distribution. However, users often take advantage of perceived anonymity on social media platforms to share offensive or hateful content. Thus, offensive language has grown as a significant issue with the increase in online communication and the popularity of social media platforms. This problem has attracted significant attention for devising methods for detecting offensive content and preventing its spread on online social networks. Therefore, this paper aims to develop an effective Arabic offensive language detection model by employing deep learning and semantic and contextual features. This paper proposes a deep learning approach that utilizes the bidirectional long short-term memory (BiLSTM) model and domain-specific word embeddings extracted from an Arabic offensive dataset. The detection approach was evaluated on an Arabic dataset collected from Twitter. The results showed the highest performance accuracy of 0.93% with the BiLSTM model trained using a combination of domain-specific and agnostic-domain word embeddings.

Keywords

Arabic Natural Language Processing; Arabic Tweets; Offensive Language Detection; Offensive Language; Word Embeddings

Hrčak ID:

279413

URI

https://hrcak.srce.hr/279413

Publication date:

21.6.2022.

Visits: 2.459 *

Login and registration

Technical Journal, Vol. 16 No. 3, 2022.

Abstract

Keywords

Hrčak ID:

URI

Publication date: