Technical gazette, Vol. 29 No. 1, 2022.
Professional paper
https://doi.org/10.17559/TV-20200914224607
A Comparison of Query Execution Speeds for Large Amounts of Data Using Various DBMS Engines Executing on Selected RAM and CPU Configurations
Slaviša Ilić
; University Singidunum, Danijelova 32, 11000 Belgrade, Serbia
Siniša Ilić
; Faculty of Technical Sciences, Knjaza Miloša 7, 38220 Kosovska Mitrovica
Ivan Milovanović
; University Singidunum, Danijelova 32, 11000 Belgrade, Serbia
Petar Spalević*
; Faculty of Technical Sciences, Knjaza Miloša 7, 38220 Kosovska Mitrovica
Dragiša Miljković
; Faculty of Technical Sciences, Knjaza Miloša 7, 38220 Kosovska Mitrovica
Abstract
In modern economies, most important business decisions are based on detailed analysis of available data. In order to obtain a rapid response from analytical tools, data should be pre-aggregated over dimensions that are of most interest to each business. Sometimes however, important decisions may require analysis of business data over seemingly less important dimensions which have not been pre-aggregated during the ETL process. On these occasions, the ad-hoc "online" aggregation is performed whose execution time is dependent on the overall DBMS performance. This paper describes how the performance of several commercial and non-commercial DBMSs was tested by running queries designed for data analysis using "ad-hoc" aggregations over large volumes of data. Each DBMS was installed on a separate virtual machine and was run on several computers, and two amounts of RAM memory were allocated for each test. Measurements of query execution times were recorded which demonstrated that, as expected, column-oriented databases out-performed classical row-oriented database systems.
Keywords
data volume performance impact; hardware contribution to DBMS performance; on-line data aggregation; row-versus column oriented DBMS
Hrčak ID:
269609
URI
Publication date:
15.2.2022.
Visits: 1.714 *