Original scientific paper
https://doi.org/10.17535/crorr.2025.0014
Going concern prediction – A horse race between traditional and regularization machine learning models
Tina Vuko
orcid.org/0000-0002-7030-2130
; University of Split, Faculty of Economics, Business and Tourism, Department of Quantitative Methods, Cvite Fiskovića 5, 21000 Split, Croatia
*
Slavko Šodan
orcid.org/0000-0002-5043-6801
; University of Split, Faculty of Economics, Business and Tourism, Department of Quantitative Methods, Cvite Fiskovića 5, 21000 Split, Croatia
Ivana Perica
orcid.org/0000-0001-8395-5096
; University of Split, Faculty of Economics, Business and Tourism, Department of Quantitative Methods, Cvite Fiskovića 5, 21000 Split, Croatia
* Corresponding author.
Abstract
Regularization machine learning (ML) methods have been increasingly applied in accounting research, offering new possibilities in predictive modeling. Their forte lies in the effective regularization methods for resolving the biggest concern of generalization, which is the risk of overfitting the training data. While these sophisticated methods are known to outperform traditional regression approaches in large and balanced datasets, this may not be the case when facing imbalanced and small datasets. Moreover, model validation is also challenging in such settings because traditional performance measures, such as prediction accuracy, may be misleading. We address this problem by comparing two traditional and five regularization-based methods in predicting going concern uncertainty (GCU) on the sample of listed companies in Croatia. We take caution when evaluating the models due to class-imbalanced problems and include different classification performance measures, as well as calibration of the models to account for their uncertainty. As expected, no model performs best across all evaluation criteria, but regularization methods are better calibrated. Given our results, we suggest that model selection should consider the results of the model calibration, a combination of different performance metrics, and the economic impact of the statistical performance of the model, if feasible.
Keywords
elastic net; lasso; ridge regression; class imbalance; prediction
Hrčak ID:
329783
URI
Publication date:
2.4.2025.
Visits: 415 *