Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.32985/ijeces.16.1.7

Augmented Language Dataset for Enhanced Personality Profiling

Mohmad Azhar Teli ; Department of Computer Science; University of Kashmir, Hazratbal Srinagar, Srinagar 190006, India *
Manzoor Ahmad Chachoo ; Department of Computer Science; University of Kashmir, Hazratbal Srinagar, Srinagar 190006, India

* Dopisni autor.


Puni tekst: engleski pdf 678 Kb

str. 65-74

preuzimanja: 0

citiraj


Sažetak

The lexical hypothesis asserts that language encompasses all meaningful individual differences in personality. Language is a vital tool for communication and self-expression, making it essential for understanding and assessing human personality. This paper investigates personality recognition from language use, emphasizing the significance of language in capturing and analyzing personality traits. A comprehensive literature review examines various approaches and techniques in personality recognition. We investigate the effectiveness of language use in predicting personality traits, employing multiple feature extraction and data augmentation techniques to enhance the accuracy and robustness of the personality recognition models. Our approach involves training a generative model, PersonaG, on the Essays dataset, subsequently using it to generate augmented data (AUG-Essays). We compare the performance of machine learning classifiers using LIWC, TF-IDF, Glove, and Word-Vec features on both Essays and AUG-Essays datasets. Our findings demonstrate significant improvements in predictive performance, offering valuable insights for applications in human resources, marketing, and beyond.

Ključne riječi

Personality; Social Signal Processing; Natural Language Processing;

Hrčak ID:

326076

URI

https://hrcak.srce.hr/326076

Datum izdavanja:

2.1.2025.

Posjeta: 0 *