Technical gazette, Vol. 31 No. 2, 2024.
Preliminary communication
https://doi.org/10.17559/TV-20230719000815
A Parallel Mining Algorithm for Maximum Erasable Itemset Based on Multi-core Processor
Qunli Zhao
orcid.org/0000-0002-3036-7837
; School of Computer and Artificial Intelligence, Hefei Normal University, Hefei, China 230061
*
Hesheng Cheng
; School of Computer and Artificial Intelligence, Hefei Normal University, Hefei, China 230061
Chen Shen
; School of Computer and Artificial Intelligence, Hefei Normal University, Hefei, China 230061
* Corresponding author.
Abstract
Mining the erasable itemset is an interesting research domain, which has been applied to solve the problem of how to efficiently use limited funds to optimise production in economic crisis. After the problem of mining the erasable itemset was posed, researchers have proposed many algorithms to solve it, among which mining the maximum erasable itemset is a significant direction for research. Since all subsets of the maximum erasable itemset are erasable itemsets, all erasable itemsets can be obtained by mining the maximum erasable itemset, which reduces both the quantity of candidate and resultant itemsets generated during the mining process. However, computing many itemset values still takes a lot of CPU time when mining huge amounts of data. And it is difficult to solve the problem quickly with sequential algorithms. Therefore, this proposed study presents a parallel algorithm for the mining of maximum erasable itemsets, called PAMMEI, based on a multi-core processor platform. The algorithm divides the entire mining task into multiple subtasks and assigns them to multiple processor cores for parallel execution, while using an efficient pruning strategy to downsize the space to be searched and increase the mining speed. To verify the efficiency of the PAMMEI algorithm, the paper compares it with most advanced algorithms. The experimental results show that PAMMEI is superior to the comparable algorithms with respect to runtime, memory usage and scalability.
Keywords
data mining; erasable itemset; maximum erasable itemset; multi-core platform; product sets
Hrčak ID:
314855
URI
Publication date:
29.2.2024.
Visits: 856 *