Technical gazette, Vol. 30 No. 6, 2023.
Original scientific paper
https://doi.org/10.17559/TV-20230316000447
A Hybrid Technological Innovation Text Mining, Ensemble Learning and Risk Scorecard Approach for Enterprise Credit Risk Assessment
Yang Mao
; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China
Shifeng Liu
; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China
Daqing Gong
; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China
*
* Corresponding author.
Abstract
Enterprise credit risk assessment models typically use financial-based information as a predictor variable, relying on backward-looking historical information rather than forward-looking information for risk assessment. We propose a novel hybrid assessment of credit risk that uses technological innovation information as a predictor variable. Text mining techniques are used to extract this information for each enterprise. A combination of random forest and extreme gradient boosting are used for indicator screening, and finally, risk scorecard based on logistic regression is used for credit risk scoring. Our results show that technological innovation indicators obtained through text mining provide valuable information for credit risk assessment, and that the combination of ensemble learning from random forest and extreme gradient boosting combinations with logistic regression models outperforms other traditional methods. The best results achieved 0.9129 area under receiver operating characteristic. In addition, our approach provides meaningful scoring rules for credit risk assessment of technology innovation enterprises.
Keywords
ensemble learning; risk assessment; risk scorecard; technological innovation; text mining
Hrčak ID:
309218
URI
Publication date:
25.10.2023.
Visits: 1.037 *