Izvorni znanstveni članak
https://doi.org/10.32985/ijeces.16.1.7
Augmented Language Dataset for Enhanced Personality Profiling
Mohmad Azhar Teli
; Department of Computer Science; University of Kashmir, Hazratbal Srinagar, Srinagar 190006, India
*
Manzoor Ahmad Chachoo
; Department of Computer Science; University of Kashmir, Hazratbal Srinagar, Srinagar 190006, India
* Dopisni autor.
Sažetak
The lexical hypothesis asserts that language encompasses all meaningful individual differences in personality. Language is a vital tool for communication and self-expression, making it essential for understanding and assessing human personality. This paper investigates personality recognition from language use, emphasizing the significance of language in capturing and analyzing personality traits. A comprehensive literature review examines various approaches and techniques in personality recognition. We investigate the effectiveness of language use in predicting personality traits, employing multiple feature extraction and data augmentation techniques to enhance the accuracy and robustness of the personality recognition models. Our approach involves training a generative model, PersonaG, on the Essays dataset, subsequently using it to generate augmented data (AUG-Essays). We compare the performance of machine learning classifiers using LIWC, TF-IDF, Glove, and Word-Vec features on both Essays and AUG-Essays datasets. Our findings demonstrate significant improvements in predictive performance, offering valuable insights for applications in human resources, marketing, and beyond.
Ključne riječi
Personality; Social Signal Processing; Natural Language Processing;
Hrčak ID:
326076
URI
Datum izdavanja:
2.1.2025.
Posjeta: 0 *