A Simple Density with Distance Based Initial Seed Selection Technique for K Means Algorithm

Syed Azimuddin, Sajidha; Desikan, Kalyani

doi:10.20532/cit.2017.1003605

Journal of computing and information technology, Vol. 25 No. 4, 2017.

Original scientific paper

https://doi.org/10.20532/cit.2017.1003605

A Simple Density with Distance Based Initial Seed Selection Technique for K Means Algorithm

Sajidha Syed Azimuddin orcid.org/0000-0003-4771-3131 ; School of Computing Science and Engineering, VIT, Chennai, India
Kalyani Desikan orcid.org/0000-0002-3074-5826 ; Department of mathematics, School of Advanced Sciences, VIT, Chennai, India

Full text: english pdf 379 Kb

page 291-300

downloads: 895

cite

APA 6th Edition

Syed Azimuddin, S. & Desikan, K. (2017). A Simple Density with Distance Based Initial Seed Selection Technique for K Means Algorithm. Journal of computing and information technology, 25 (4), 291-300. https://doi.org/10.20532/cit.2017.1003605

MLA 8th Edition

Syed Azimuddin, Sajidha and Kalyani Desikan. "A Simple Density with Distance Based Initial Seed Selection Technique for K Means Algorithm." Journal of computing and information technology, vol. 25, no. 4, 2017, pp. 291-300. https://doi.org/10.20532/cit.2017.1003605. Accessed 5 Jan. 2025.

Chicago 17th Edition

Syed Azimuddin, Sajidha and Kalyani Desikan. "A Simple Density with Distance Based Initial Seed Selection Technique for K Means Algorithm." Journal of computing and information technology 25, no. 4 (2017): 291-300. https://doi.org/10.20532/cit.2017.1003605

Harvard

Syed Azimuddin, S., and Desikan, K. (2017). 'A Simple Density with Distance Based Initial Seed Selection Technique for K Means Algorithm', Journal of computing and information technology, 25(4), pp. 291-300. https://doi.org/10.20532/cit.2017.1003605

Vancouver

Syed Azimuddin S, Desikan K. A Simple Density with Distance Based Initial Seed Selection Technique for K Means Algorithm. Journal of computing and information technology [Internet]. 2017 [cited 2025 January 05];25(4):291-300. https://doi.org/10.20532/cit.2017.1003605

IEEE

S. Syed Azimuddin and K. Desikan, "A Simple Density with Distance Based Initial Seed Selection Technique for K Means Algorithm", Journal of computing and information technology, vol.25, no. 4, pp. 291-300, 2017. [Online]. https://doi.org/10.20532/cit.2017.1003605

Abstract

Open issues with respect to K means algorithm are identifying the number of clusters, initial seed concept selection, clustering tendency, handling empty clusters, identifying outliers etc. In this paper we propose a novel and a simple technique considering both density and distance of the concepts in a dataset to identify initial seed concepts for clustering. Many authors have proposed different techniques to identify initial seed concepts; but our method ensures that the initial seed concepts are chosen from different clusters that are to be generated by the clustering solution. The hallmark of our algorithm is that it is a single pass algorithm that does not require any extra parameters to be estimated. Further, our seed concepts are one among the actual concepts and not the mean of representative concepts as is the case in many other algorithms. We have implemented our proposed algorithm and compared the results with the interval based technique of Fouad Khan. We see that our method outperforms the interval based method. We have also compared our method with the original random K means and K Means++ algorithms.

Keywords

Computer science; Information Systems

Hrčak ID:

192040

URI

https://hrcak.srce.hr/192040

Publication date:

5.1.2018.

Visits: 1.736 *

Login and registration

Journal of computing and information technology, Vol. 25 No. 4, 2017.

Abstract

Keywords

Hrčak ID:

URI

Publication date: