Izvorni znanstveni članak
https://doi.org/10.24138/jcomss-2024-0121
Comparative Analysis of SMOTE and ROSE Oversampling Techniques for kNN-Based Autonomous Vehicle Behavior Modeling
Celine Serbouh Touazi
; Conservatoire National des Arts et Metiers, France
*
Iness Ahriz
; Conservatoire National des Arts et Metiers, France
Ndeye Niang
; Conservatoire National des Arts et Metiers, France
Alain Piperno
; Union technique de l’automobile, du motocycle et du cycle (UTAC), France
* Dopisni autor.
Sažetak
In this paper, we present a comparative analysis of Synthetic Minority Oversampling TEchnique (SMOTE) and Random OverSampling Examples (ROSE) oversampling techniques for K Nearest Neigbhors KNN-based autonomous vehicle behavior modeling. We address the challenges posed by imbalanced and mixed datasets in the context of autonomous vehicle testing, where the majority of test outcomes are classified as ”OK” (safe) and fewer as ”KO” (unsafe). We propose an enhanced approach that extends our previous work by incorporating ROSE as an alternative to SMOTE for generating synthetic samples. We integrate these resampling techniques with Leave- One-Out Cross-Validation (LOO-CV), applying resampling at each iteration to ensure data balancing is tailored to each training set. Additionally, we investigate the impact of different encoding strategies for categorical variables, including OneHot, binary encoding, and Factor Analysis of Mixed Data (FAMD). Our research aims to develop a robust classification model capable of accurately predicting autonomous vehicle behavior while effectively managing class imbalance and mixed data types, despite the limited availability of data due to costly and timeconsuming testing procedures.
Ključne riječi
IA; Autonomous Vehicles; SMOTE; ROSE
Hrčak ID:
330889
URI
Datum izdavanja:
30.6.2025.
Posjeta: 83 *