Data augmentation using a 1D-CNN model with MFCC/MFMC features for speech emotion recognition

Mary Little Flower, Thomas; Jaya, Thirasama; Christopher Ezhil Singh, Sreedharan

doi:10.1080/00051144.2024.2371249

Automatika : časopis za automatiku, mjerenje, elektroniku, računarstvo i komunikacije, Vol. 65 No. 4, 2024.

Izvorni znanstveni članak

https://doi.org/10.1080/00051144.2024.2371249

Data augmentation using a 1D-CNN model with MFCC/MFMC features for speech emotion recognition

Thomas Mary Little Flower ; Department of ECE, St.Xavier’s Catholic College of Engineering, Chunkankadai, India *
Thirasama Jaya ; Department of ECE, Saveetha College of Engineering, Chennai, India
Sreedharan Christopher Ezhil Singh ; Department of Mechanical Engineering, Vimal Jyothi Engineering College, Kannur, India

* Dopisni autor.

Puni tekst: engleski pdf 4.346 Kb

str. 1325-1338

preuzimanja: 0

citiraj

APA 6th Edition

Mary Little Flower, T., Jaya, T. i Christopher Ezhil Singh, S. (2024). Data augmentation using a 1D-CNN model with MFCC/MFMC features for speech emotion recognition. Automatika, 65 (4), 1325-1338. https://doi.org/10.1080/00051144.2024.2371249

MLA 8th Edition

Mary Little Flower, Thomas, et al. "Data augmentation using a 1D-CNN model with MFCC/MFMC features for speech emotion recognition." Automatika, vol. 65, br. 4, 2024, str. 1325-1338. https://doi.org/10.1080/00051144.2024.2371249. Citirano 09.01.2025.

Chicago 17th Edition

Mary Little Flower, Thomas, Thirasama Jaya i Sreedharan Christopher Ezhil Singh. "Data augmentation using a 1D-CNN model with MFCC/MFMC features for speech emotion recognition." Automatika 65, br. 4 (2024): 1325-1338. https://doi.org/10.1080/00051144.2024.2371249

Harvard

Mary Little Flower, T., Jaya, T., i Christopher Ezhil Singh, S. (2024). 'Data augmentation using a 1D-CNN model with MFCC/MFMC features for speech emotion recognition', Automatika, 65(4), str. 1325-1338. https://doi.org/10.1080/00051144.2024.2371249

Vancouver

Mary Little Flower T, Jaya T, Christopher Ezhil Singh S. Data augmentation using a 1D-CNN model with MFCC/MFMC features for speech emotion recognition. Automatika [Internet]. 2024 [pristupljeno 09.01.2025.];65(4):1325-1338. https://doi.org/10.1080/00051144.2024.2371249

IEEE

T. Mary Little Flower, T. Jaya i S. Christopher Ezhil Singh, "Data augmentation using a 1D-CNN model with MFCC/MFMC features for speech emotion recognition", Automatika, vol.65, br. 4, str. 1325-1338, 2024. [Online]. https://doi.org/10.1080/00051144.2024.2371249

Sažetak

Speech emotion recognition (SER) is attractive in several domains, such as automated translation,
call centres, intelligent healthcare, and human–computer interaction. Deep learning models for
emotion identification need considerable labelled data, which is only sometimes available in the
SER industry. A database needs enough speech samples, good features, and a better classifier to
identify emotions efficiently. This study uses data augmentation to enhance the amount of input
voice samples and address the data shortage issue. The database capacity increases by adding
white noise to the speech signals by data augmentation. In this work, the Mel-frequency Cepstral
Coefficient (MFCC) and Mel-frequency Magnitude Coefficient (MFMC) features, along with a onedimensional convolutional neural network (1D-CNN), are used to classify speech emotions. The
datasets utilized to estimate the model’s enactment were AESDD, CAFE, EmoDB, IEMOCAP, and
MESD. The data augmentation with the 1D-CNN (MFMC) model performed best, with an average
accuracy of 99.2% for AESDD, 99.5% for CAFE, 97.5% for EmoDB, 92.4% for IEMOCAP and 96.9%
for the MESD database. The proposed 1D-CNN (MFMC) with data augmentation outperforms the
1D-CNN (MFCC) without data augmentation in emotion recognition.

Ključne riječi

Neural networks; affective computing; emotion recognition; audio database; accuracy

Hrčak ID:

326329

URI

https://hrcak.srce.hr/326329

Datum izdavanja:

3.7.2024.

Posjeta: 0 *

Prijava i registracija

Automatika : časopis za automatiku, mjerenje, elektroniku, računarstvo i komunikacije, Vol. 65 No. 4, 2024.

Sažetak

Ključne riječi

Hrčak ID:

URI

Datum izdavanja: