Professional paper
Clustering with Open Source Tools
VLADIMIR ŠPIŠIĆ
; KARLOVAC UNIVERSITY OF APPLED SCIENCES
IVAN ŠTEDUL
; KARLOVAC UNIVERSITY OF APPLIED SCIENCES
VEDRAN VYROUBAL
orcid.org/0000-0001-8876-1768
; KARLOVAC UNIVERSITY OF APPLIED SCIENCES
Abstract
Data processing represents one of the key steps during most research projects. In most cases structure of data to process is not known in advance, so it is necessary during the data analysis to group the research data into data clusters, from which research conclusions can be derived. Today large numbers of methods, as well as diverse set of software tools are used for data clustering. Many of such software tools are open source software, which in quality in many cases surpass the quality of many commercial software solutions. This paper will provide an overview of one of the most used methods for hierarchical data clustering, as well as overview of open source software tools for using the afore mentioned method (e.g. CLUTO, R). Some of the software tools are implemented as standalone applications, while others are implemented as libraries which can be easily invoked from within some other programming language development environment.
Keywords
Hrčak ID:
87439
URI
Publication date:
20.8.2012.
Visits: 2.325 *