Original scientific paper
https://doi.org/10.24138/jcomss.v8i4.164
Automated Clustering of Virtual Machines based on Correlation of Resource Usage
Claudia Canali
; University of Modena and Reggio Emilia, Department of Engineering “Enzo Ferrari”
Riccardo Lancellotti
orcid.org/0000-0002-9470-8784
; University of Modena and Reggio Emilia, Department of Engineering “Enzo Ferrari”
Abstract
The recent growth in demand for modern applications combined with the shift to the Cloud computing paradigm have led to the establishment of large-scale cloud data centers. The increasing size of these infrastructures represents a major challenge in terms of monitoring and management of the system resources. Available solutions typically consider every Virtual Machine (VM) as a black box each with independent characteristics, and face scalability issues by reducing the number of monitored resource samples, considering in most cases only average CPU usage sampled at a coarse time granularity. We claim that scalability issues can be addressed by leveraging the similarity between VMs in terms of resource usage patterns. In this paper we propose an automated methodology to cluster VMs depending on the usage of multiple resources, both systemand network-related, assuming no knowledge of the services executed on them. This is an innovative methodology that exploits the correlation between the resource usage to cluster together similar VMs. We evaluate the methodology through a case study with data coming from an enterprise datacenter, and we show that high performance may be achieved in automatic VMs clustering. Furthermore, we estimate the reduction in the amount of data collected, thus showing that our proposal may simplify the monitoring requirements and help administrators to take decisions on the resource management of cloud computing datacenters.
Keywords
Cloud computing; VM Clustering; k-means; Correlation analysis
Hrčak ID:
180217
URI
Publication date:
21.12.2012.
Visits: 1.119 *