A Matrix Logic Approach to Efficient Frequent Itemset Discovery in Large Data Sets
Xuan Li, Tingyi Ruan, Yankaiqi Li, Quanchao Lu, Xiaoxuan Sun

TL;DR
This paper introduces a Boolean matrix-based frequent itemset mining algorithm that improves efficiency and scalability in large datasets by leveraging matrix operations instead of candidate generation.
Contribution
The proposed algorithm uses Boolean matrix operations for frequent itemset mining, reducing storage and computational bottlenecks in high-dimensional, large-scale transaction databases.
Findings
Efficiently mines frequent itemsets with low support thresholds.
Maintains good performance as transaction numbers increase.
Demonstrates scalability and robustness in large datasets.
Abstract
This paper proposes a frequent itemset mining algorithm based on the Boolean matrix method, aiming to solve the storage and computational bottlenecks of traditional frequent pattern mining algorithms in high-dimensional and large-scale transaction databases. By representing the itemsets in the transaction database as Boolean matrices, the algorithm uses Boolean logic operations such as AND and OR to efficiently calculate the support of the itemsets, avoiding the generation and storage of a large number of candidates itemsets in traditional algorithms. The algorithm recursively mines frequent itemsets through matrix operations and can flexibly adapt to different data scales and support thresholds. In the experiment, the public Groceries dataset was selected, and the running efficiency test and frequent itemset mining effect test were designed to evaluate the algorithm's performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Bayesian Modeling and Causal Inference · Data Management and Algorithms
