Technical gazette, Vol. 30 No. 1, 2023.
Original scientific paper
https://doi.org/10.17559/TV-20220623113041
A Text Mining and Ensemble Learning Based Approach for Credit Risk Prediction
Yang Mao
; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China
Shifeng Liu
; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China
Daqing Gong
; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China
Abstract
Traditional credit risk prediction models mainly rely on financial data. However, technological innovation is the main driving force for the development of enterprises in strategic emerging industries, which is closely related to enterprise credit risk. In this paper, a novel prediction framework utilizing technological innovation text mining data and ensemble learning is proposed. The empirical data from China listed enterprises in strategic emerging industries were applied to construct prediction models using the classification and regression tree model, the random forest model and extreme gradient boosting model. The results show that the model uses the technological innovation text mining data proven to have significant predict ability, and top management teamꞌs attention to innovation variables offer the best prediction capacities. This work improves the application value of enterprise credit risk prediction models in strategic emerging industries by embedding the mining of technological innovation text information.
Keywords
credit risks; ensemble learning; strategic emerging industries; text mining
Hrčak ID:
288405
URI
Publication date:
15.12.2022.
Visits: 1.387 *