Skip to the main content

Original scientific paper

https://doi.org/10.17559/TV-20220623113041

A Text Mining and Ensemble Learning Based Approach for Credit Risk Prediction

Yang Mao ; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China
Shifeng Liu ; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China
Daqing Gong ; School of Economics and Management, Beijing Jiaotong University, Haidian, 100044, China


Full text: english pdf 977 Kb

page 138-147

downloads: 479

cite


Abstract

Traditional credit risk prediction models mainly rely on financial data. However, technological innovation is the main driving force for the development of enterprises in strategic emerging industries, which is closely related to enterprise credit risk. In this paper, a novel prediction framework utilizing technological innovation text mining data and ensemble learning is proposed. The empirical data from China listed enterprises in strategic emerging industries were applied to construct prediction models using the classification and regression tree model, the random forest model and extreme gradient boosting model. The results show that the model uses the technological innovation text mining data proven to have significant predict ability, and top management teamꞌs attention to innovation variables offer the best prediction capacities. This work improves the application value of enterprise credit risk prediction models in strategic emerging industries by embedding the mining of technological innovation text information.

Keywords

credit risks; ensemble learning; strategic emerging industries; text mining

Hrčak ID:

288405

URI

https://hrcak.srce.hr/288405

Publication date:

15.12.2022.

Visits: 1.298 *