Skip to the main content

Original scientific paper

https://doi.org/10.54820/PUCR5250

Novel Approach to Choosing Principal Components Number in Logistic Regression

Borislava Vrigazova orcid id orcid.org/0000-0001-9335-6927 ; Sofia University, Bulgaria


Full text: english pdf 221 Kb

page 1-12

downloads: 341

cite


Abstract

The confirmed approach to choosing the number of principal components for prediction models includes exploring the contribution of each principal component to the total variance of the target variable. A combination of possible important principal components can be chosen to explain a big part of the variance in the target. Sometimes several combinations of principal components should be explored to achieve the highest accuracy in classification. This research proposes a novel automatic way of deciding how many principal components should be retained to improve classification accuracy. We do that by combining principal components with the ANOVA selection. To improve the accuracy resulting from our automatic approach, we use the bootstrap procedure for model selection. We call this procedure the Bootstrapped-ANOVA PCA selection. Our results suggest that this procedure can automate the principal components selection and improve the accuracy of classification models, in our example, the logistic regression.

Keywords

ANOVA; PCA; Bootstrap; logistic regression

Hrčak ID:

271527

URI

https://hrcak.srce.hr/271527

Publication date:

7.12.2021.

Visits: 833 *