hrcak mascot   Srce   HID

Izvorni znanstveni članak

A Generic Procedure for Integration Testing of ETL Procedures

Igor Mekterović ; Department of Applied Computing, Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000, Zagreb, Croatia
Ljiljana Brkić ; Department of Applied Computing, Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000, Zagreb, Croatia
Mirta Baranović ; Department of Applied Computing, Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000, Zagreb, Croatia

Puni tekst: engleski, pdf (1 MB) str. 169-178 preuzimanja: 3.471* citiraj
APA 6th Edition
Mekterović, I., Brkić, Lj. i Baranović, M. (2011). A Generic Procedure for Integration Testing of ETL Procedures. Automatika, 52 (2), 169-178. Preuzeto s https://hrcak.srce.hr/71300
MLA 8th Edition
Mekterović, Igor, et al. "A Generic Procedure for Integration Testing of ETL Procedures." Automatika, vol. 52, br. 2, 2011, str. 169-178. https://hrcak.srce.hr/71300. Citirano 21.10.2021.
Chicago 17th Edition
Mekterović, Igor, Ljiljana Brkić i Mirta Baranović. "A Generic Procedure for Integration Testing of ETL Procedures." Automatika 52, br. 2 (2011): 169-178. https://hrcak.srce.hr/71300
Harvard
Mekterović, I., Brkić, Lj., i Baranović, M. (2011). 'A Generic Procedure for Integration Testing of ETL Procedures', Automatika, 52(2), str. 169-178. Preuzeto s: https://hrcak.srce.hr/71300 (Datum pristupa: 21.10.2021.)
Vancouver
Mekterović I, Brkić Lj, Baranović M. A Generic Procedure for Integration Testing of ETL Procedures. Automatika [Internet]. 2011 [pristupljeno 21.10.2021.];52(2):169-178. Dostupno na: https://hrcak.srce.hr/71300
IEEE
I. Mekterović, Lj. Brkić i M. Baranović, "A Generic Procedure for Integration Testing of ETL Procedures", Automatika, vol.52, br. 2, str. 169-178, 2011. [Online]. Dostupno na: https://hrcak.srce.hr/71300. [Citirano: 21.10.2021.]

Sažetak
In order to attain a certain degree of confidence in the quality of the data in the data warehouse it is necessary to perform a series of tests. There are many components (and aspects) of the data warehouse that can be tested, and in this paper we focus on the ETL procedures. Due to the complexity of ETL process, ETL procedure tests are usually custom written, having a very low level of reusability. In this paper we address this issue and work towards establishing a generic procedure for integration testing of certain aspects of ETL procedures. In this approach, ETL procedures are treated as a black box and are tested by comparing their inputs and outputs – datasets. Datasets from three locations are compared: datasets from the relational source(s), datasets from the staging area and datasets from the data warehouse. Proposed procedure is generic and can be implemented on any data warehouse employing dimensional model and having relational database(s) as a source. Our work pertains only to certain aspects of data quality problems that can be found in DW systems. It provides a basic testing foundation or augments existing data warehouse system’s testing capabilities. We comment on proposed mechanisms both in terms of full reload and incremental loading.

Ključne riječi
Data quality; Data warehouse; Dimensional model; ETL testing

Hrčak ID: 71300

URI
https://hrcak.srce.hr/71300

[hrvatski]

Posjeta: 3.959 *