Skip to the main content

Original scientific paper

https://doi.org/10.17559/TV-20200828055024

Understanding the Evaluation Abilities of External Cluster Validity Indices to Internal Ones

Xiaonan Gao ; School of Economics and Management, University of Science and Technology Beijing, 30 Xueyuan Road, Haidian District, Beijing 100083, China
Guiying Wei ; School of Economics and Management, University of Science and Technology Beijing, 30 Xueyuan Road, Haidian District, Beijing 100083, China
Sen Wu* ; School of Economics and Management, University of Science and Technology Beijing, 30 Xueyuan Road, Haidian District, Beijing 100083, China
Falong Fan ; School of Economics and Management, University of Science and Technology Beijing, 30 Xueyuan Road, Haidian District, Beijing 100083, China


Full text: english pdf 651 Kb

page 1956-1964

downloads: 527

cite


Abstract

Evaluating internal Cluster Validity Index (CVI) is a critical task in clustering research. Existing studies mainly employ the number of clusters (NC-based method) or external CVIs (external CVIs-based method) to evaluate internal CVIs, which are not always reasonable in all scenarios. Additionally, there is no guideline of choosing appropriate methods to evaluate internal CVIs in different cases. In this paper, we focus on the evaluation abilities of external CVIs to internal CVIs, and propose a novel approach, named external CVI's evaluation Ability MEasurement approach through Ranking consistency (CAMER), to measure the evaluation abilities of external CVIs quantitatively, for assisting in selecting appropriate external CVIs to evaluate internal CVIs. Specifically, we formulate the evaluation ability measurement problem as a ranking consistency task, by measuring the consistency between the evaluation results of external CVIs to internal CVIs and the ground truth performance of internal CVIs. Then, the superiority of CAMER is validated through a real-world case. Moreover, the evaluation abilities of seven popular external CVIs to internal CVIs in six different scenarios are explored by CAMER. Finally, these explored evaluation abilities are validated on four real-world datasets, demonstrating the effectiveness of CAMER.

Keywords

cluster validity indices (CVIs); evaluation ability; quantitative measurement; ranking consistency

Hrčak ID:

248247

URI

https://hrcak.srce.hr/248247

Publication date:

19.12.2020.

Visits: 1.263 *