MINING TOP-K CORRELATED HIGH UTILITY ITEMSETS ON TRANSACTION DATABASE
Corresponding Author(s) : Manh Thien Ly
HUIT Journal of Science,
Vol. 23 No. 1 (2023)
Abstract
In this paper, we propose a new research direction, which is to exploit the top-k Correlated High Utility Itemset (TCHUI) on the transaction database. Combination of mining Correlated High Utility Itemsets and mining Top-K High Utility Itemsets to find the top-k correlated high utility itemsets on the transaction database. To address this issue, we combine the Correlated High Utility Itemset (CoHUI) mining with the threshold raising strategies and propose the TCH algorithm. This algorithm uses the Utility List data structure to store data about the utility of itemsets, uses the Kulc threshold to measure correlation, and applies several pruning strategies such as U-Prune, TWU -Prune, LA-Prune to reduce the search space. Besides that, the threshold raising strategies are applied such as RIU, LIU-E, and RUC to exploit the TCHUI set effectively. Our experiment results on large datasets including Chess, Mushroom, Retail, and Chainstore and compare with the state-of-the-art THUI algorithm. The results show that the proposed algorithm has better performance than the THUI algorithm in terms of execution time and memory usage.
Keywords
Download Citation
Endnote/Zotero/Mendeley (RIS)BibTeX