Skip to the main content

Original scientific paper

https://doi.org/10.17559/TV-20220421110027

A Cross-project Defect Prediction Model Using Feature Transfer and Ensemble Learning

Fuping Zeng ; School of Reliability and Systems Engineering, Beihang University, Beijing100191, China
Wanting Lin ; School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Ying Xing ; School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Lu Sun ; School of Reliability and Systems Engineering, Beihang University, Beijing100191, China
Bin Yang ; Du Xiaoman (Beijing) Science Technology Co., Ltd., Beijing 100094, China


Full text: english pdf 1.128 Kb

page 1089-1099

downloads: 567

cite


Abstract

Cross-project defect prediction (CPDP) trains the prediction models with existing data from other projects (the source projects) and uses the trained model to predict the target projects. To solve two major problems in CPDP, namely, variability in data distribution and class imbalance, in this paper we raise a CPDP model combining feature transfer and ensemble learning, with two stages of feature transfer and the classification. The feature transfer method is based on Pearson correlation coefficient, which reduces the dimension of feature space and the difference of feature distribution between items. The class imbalance is solved by SMOTE and Voting on both algorithm and data levels. The experimental results on 20 source-target projects show that our method can yield significant improvement on CPDP.

Keywords

cross-project defect prediction; ensemble learning; machine learning; transfer learning

Hrčak ID:

279430

URI

https://hrcak.srce.hr/279430

Publication date:

17.6.2022.

Visits: 1.314 *