hrcak mascot   Srce   HID

Izvorni znanstveni članak
https://doi.org/10.17559/TV-20200918143701

Clustering Algorithm Based on Sparse Feature Vector without Specifying Parameter

Huixia He ; School of Economics and Management, University of Science and Technology Beijing, No. 30 Xueyuan Road, Haidian District, Beijing, China
Guiying Wei ; School of Economics and Management, University of Science and Technology Beijing, No. 30 Xueyuan Road, Haidian District, Beijing, China
Sen Wu* ; School of Economics and Management, University of Science and Technology Beijing, No. 30 Xueyuan Road, Haidian District, Beijing, China
Xiaonan Gao ; School of Economics and Management, University of Science and Technology Beijing, No. 30 Xueyuan Road, Haidian District, Beijing, China

Puni tekst: engleski, pdf (585 KB) str. 1974-1981 preuzimanja: 55* citiraj
APA 6th Edition
He, H., Wei, G., Wu*, S. i Gao, X. (2020). Clustering Algorithm Based on Sparse Feature Vector without Specifying Parameter. Tehnički vjesnik, 27 (6), 1974-1981. https://doi.org/10.17559/TV-20200918143701
MLA 8th Edition
He, Huixia, et al. "Clustering Algorithm Based on Sparse Feature Vector without Specifying Parameter." Tehnički vjesnik, vol. 27, br. 6, 2020, str. 1974-1981. https://doi.org/10.17559/TV-20200918143701. Citirano 18.01.2021.
Chicago 17th Edition
He, Huixia, Guiying Wei, Sen Wu* i Xiaonan Gao. "Clustering Algorithm Based on Sparse Feature Vector without Specifying Parameter." Tehnički vjesnik 27, br. 6 (2020): 1974-1981. https://doi.org/10.17559/TV-20200918143701
Harvard
He, H., et al. (2020). 'Clustering Algorithm Based on Sparse Feature Vector without Specifying Parameter', Tehnički vjesnik, 27(6), str. 1974-1981. https://doi.org/10.17559/TV-20200918143701
Vancouver
He H, Wei G, Wu* S, Gao X. Clustering Algorithm Based on Sparse Feature Vector without Specifying Parameter. Tehnički vjesnik [Internet]. 2020 [pristupljeno 18.01.2021.];27(6):1974-1981. https://doi.org/10.17559/TV-20200918143701
IEEE
H. He, G. Wei, S. Wu* i X. Gao, "Clustering Algorithm Based on Sparse Feature Vector without Specifying Parameter", Tehnički vjesnik, vol.27, br. 6, str. 1974-1981, 2020. [Online]. https://doi.org/10.17559/TV-20200918143701

Sažetak
Parameter setting is an essential factor affecting algorithm performance in data mining techniques. CABOSFV is an efficient clustering algorithm which can cluster binary data with sparse features, but it is challenging to specify the threshold parameter. To solve the difficulty of parameter decision, a clustering algorithm based on sparse feature vector without specifying parameter (CASP) is proposed in this paper. The calculation method of an upper limit of threshold is firstly defined to determine the range of threshold. Furthermore, we use the sparseness index to sort the data and conduct the clustering process based on the adjusted sparse feature vector after data sorting. An interval search strategy is adopted to find a suitable threshold within the defined threshold range, and the clustering result with the selected suitable parameter is the outcome. Experiments on 7 UCI datasets demonstrate that the clustering results of the CASP algorithm are superior to other baselines in terms of both effectiveness and efficiency. CASP not only simplifies the parameter decision process, but also obtains desirable clustering results quickly and stably, which shows the practicability of the algorithm.

Ključne riječi
CABOSFV; clustering; sparse feature; threshold parameter

Hrčak ID: 248249

URI
https://hrcak.srce.hr/248249

Posjeta: 96 *