Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.17559/TV-20250620002763

Intelligent Detection of Road Rage Using PNN Parameter Optimization and Multimodal Driver Speech and Text Data

Enlin Xie ; Xiangsihu College of Guangxi Minzu University, Nanning 530000, P. R. China *
Yiliu Huang ; Teachers Training Center of Guangxi Zhuang Autonomous Region Nanning, 530018, China

* Dopisni autor.


Puni tekst: engleski pdf 474 Kb

str. 684-696

preuzimanja: 74

citiraj


Sažetak

Road rage is a critical factor in traffic accidents, often expressed through vocal and semantic cues. This study proposes a multimodal road rage detection system that integrates speech and text features. Speech signals are processed by a probabilistic neural network (PNN) optimized with an Improved Sand Cat Swarm Optimization (ISCSO) algorithm, while text features are modelled with a long short-term memory (LSTM) network. A decision-level fusion strategy combines outputs from both modalities. Experiments on a self-constructed dataset of 10,000 speech samples and corresponding text corpora demonstrate that the proposed model achieves superior performance compared to CNN, DBN, TextCNN, and hybrid deep learning baselines. The system achieved maximum accuracy and recall of 95.61% and 99.31%, while maintaining low computational overhead (minimum detection time 58 ms, memory usage 10.06%). These findings suggest that the ISCSO-PNN and LSTM multimodal fusion framework provides an efficient and effective approach to detecting road rage, with strong potential for integration into real-time intelligent transportation systems.

Ključne riječi

ISCSO; parameter optimization; PNN; road rage detection; speech; text

Hrčak ID:

344994

URI

https://hrcak.srce.hr/344994

Datum izdavanja:

28.2.2026.

Posjeta: 188 *