hrcak mascot   Srce   HID

Izvorni znanstveni članak
https://doi.org/10.17559/TV-20200520034015

Data Deduplication Technology for Cloud Storage

Qinlu He ; School of Information and Control Engineering, Xi'an University of Architecture and Technology, Xi'an 710043, China
Genqing Bian* ; School of Information and Control Engineering, Xi'an University of Architecture and Technology, Xi'an 710043, China
Bilin Shao ; School of Management, Xi'an University of Architecture and Technology, Xi'an 710043, China
Weiqi Zhang ; School of Information and Control Engineering, Xi'an University of Architecture and Technology, Xi'an 710043, China

Puni tekst: engleski, pdf (861 KB) str. 1444-1451 preuzimanja: 0* citiraj
APA 6th Edition
He, Q., Bian*, G., Shao, B. i Zhang, W. (2020). Data Deduplication Technology for Cloud Storage. Tehnički vjesnik, 27 (5), 1444-1451. https://doi.org/10.17559/TV-20200520034015
MLA 8th Edition
He, Qinlu, et al. "Data Deduplication Technology for Cloud Storage." Tehnički vjesnik, vol. 27, br. 5, 2020, str. 1444-1451. https://doi.org/10.17559/TV-20200520034015. Citirano 27.10.2020.
Chicago 17th Edition
He, Qinlu, Genqing Bian*, Bilin Shao i Weiqi Zhang. "Data Deduplication Technology for Cloud Storage." Tehnički vjesnik 27, br. 5 (2020): 1444-1451. https://doi.org/10.17559/TV-20200520034015
Harvard
He, Q., et al. (2020). 'Data Deduplication Technology for Cloud Storage', Tehnički vjesnik, 27(5), str. 1444-1451. https://doi.org/10.17559/TV-20200520034015
Vancouver
He Q, Bian* G, Shao B, Zhang W. Data Deduplication Technology for Cloud Storage. Tehnički vjesnik [Internet]. 2020 [pristupljeno 27.10.2020.];27(5):1444-1451. https://doi.org/10.17559/TV-20200520034015
IEEE
Q. He, G. Bian*, B. Shao i W. Zhang, "Data Deduplication Technology for Cloud Storage", Tehnički vjesnik, vol.27, br. 5, str. 1444-1451, 2020. [Online]. https://doi.org/10.17559/TV-20200520034015

Sažetak
With the explosive growth of information data, the data storage system has stepped into the cloud storage era. Although the core of the cloud storage system is distributed file system in solving the problem of mass data storage, a large number of duplicate data exist in all storage system. File systems are designed to control how files are stored and retrieved. Fewer studies focus on the cloud file system deduplication technologies at the application level, especially for the Hadoop distributed file system. In this paper, we design a file deduplication framework on Hadoop distributed file system for cloud application developer. Proposed RFD-HDFS and FD-HDFS two data deduplication solutions process data deduplication online, which improves storage space utilisation and reduces the redundancy. In the end of the paper, we test the disk utilisation and the file upload performance on RFD-HDFS and FD-HDFS, and compare HDFS with the disk utilisation of two system frameworks. The results show that the two-system framework not only implements data deduplication function but also effectively reduces the disk utilisation of duplicate files. So, the proposed framework can indeed reduce the storage space by eliminating redundant HDFS file.

Ključne riječi
cloud storage; data deduplication; distributed; file deletion; HDFS

Hrčak ID: 244744

URI
https://hrcak.srce.hr/244744

Posjeta: 0 *