Skip to the main content

Preliminary communication

https://doi.org/10.17559/TV-20230719000815

A Parallel Mining Algorithm for Maximum Erasable Itemset Based on Multi-core Processor

Qunli Zhao orcid id orcid.org/0000-0002-3036-7837 ; School of Computer and Artificial Intelligence, Hefei Normal University, Hefei, China 230061 *
Hesheng Cheng ; School of Computer and Artificial Intelligence, Hefei Normal University, Hefei, China 230061
Chen Shen ; School of Computer and Artificial Intelligence, Hefei Normal University, Hefei, China 230061

* Corresponding author.


Full text: english pdf 1.175 Kb

page 618-627

downloads: 329

cite


Abstract

Mining the erasable itemset is an interesting research domain, which has been applied to solve the problem of how to efficiently use limited funds to optimise production in economic crisis. After the problem of mining the erasable itemset was posed, researchers have proposed many algorithms to solve it, among which mining the maximum erasable itemset is a significant direction for research. Since all subsets of the maximum erasable itemset are erasable itemsets, all erasable itemsets can be obtained by mining the maximum erasable itemset, which reduces both the quantity of candidate and resultant itemsets generated during the mining process. However, computing many itemset values still takes a lot of CPU time when mining huge amounts of data. And it is difficult to solve the problem quickly with sequential algorithms. Therefore, this proposed study presents a parallel algorithm for the mining of maximum erasable itemsets, called PAMMEI, based on a multi-core processor platform. The algorithm divides the entire mining task into multiple subtasks and assigns them to multiple processor cores for parallel execution, while using an efficient pruning strategy to downsize the space to be searched and increase the mining speed. To verify the efficiency of the PAMMEI algorithm, the paper compares it with most advanced algorithms. The experimental results show that PAMMEI is superior to the comparable algorithms with respect to runtime, memory usage and scalability.

Keywords

data mining; erasable itemset; maximum erasable itemset; multi-core platform; product sets

Hrčak ID:

314855

URI

https://hrcak.srce.hr/314855

Publication date:

29.2.2024.

Visits: 856 *