Skip to the main content

Original scientific paper

https://doi.org/10.2478/crebss-2022-0002

ANOVA bootstrapped principal components analysis for logistic regression

Borislava Toleva ; Sofia University “St Kliment Ohridski”, Bulgaria


Full text: english pdf 240 Kb

page 18-31

downloads: 442

cite


Abstract

Principal components analysis (PCA) is often used as a dimensionality reduction technique. A small number of principal components is selected to be used in a classification or a regression model to boost accuracy. A central issue in the PCA is how to select the number of principal components. Existing algorithms often result in contradictions and the researcher needs to manually select the final number of principal components to be used. In this research the author proposes a novel algorithm that automatically selects the number of principal components. This is achieved based on a combination of ANOVA ranking of principal components, the bootstrap and classification models. Unlike the classical approach, the algorithm we propose improves the accuracy of the logistic regression and selects the best combination of principal components that may not necessarily be ordered. The ANOVA bootstrapped PCA classification we propose is novel as it automatically selects the number of principal components that would maximise the accuracy of the classification model.

Keywords

ANOVA; bootstrap; classification; logistic regression

Hrčak ID:

279335

URI

https://hrcak.srce.hr/279335

Publication date:

20.6.2022.

Visits: 942 *