Fast Algorithms for Mining Interesting Frequent Itemsets without Minimum   Support

Shariq Bashir; Zahoor Jan; Abdul Rauf Baig

arXiv:0904.3319·cs.DB·April 22, 2009·4 cites

Fast Algorithms for Mining Interesting Frequent Itemsets without Minimum Support

Shariq Bashir, Zahoor Jan, Abdul Rauf Baig

PDF

Open Access

TL;DR

This paper introduces two efficient algorithms, N-MostMiner and Top-K-Miner, for mining the top interesting frequent itemsets without a minimum support threshold, improving processing time over existing methods.

Contribution

The paper proposes novel bit-vector based algorithms for top-K frequent itemset mining without support thresholds, enhancing efficiency and scalability.

Findings

01

N-MostMiner and Top-K-Miner outperform BOMO and TFP in processing time.

02

Algorithms effectively handle sparse, large, and dirty datasets.

03

Experimental results validate improved efficiency of the proposed methods.

Abstract

Real world datasets are sparse, dirty and contain hundreds of items. In such situations, discovering interesting rules (results) using traditional frequent itemset mining approach by specifying a user defined input support threshold is not appropriate. Since without any domain knowledge, setting support threshold small or large can output nothing or a large number of redundant uninteresting results. Recently a novel approach of mining only N-most/Top-K interesting frequent itemsets has been proposed, which discovers the top N interesting results without specifying any user defined support threshold. However, mining interesting frequent itemsets without minimum support threshold are more costly in terms of itemset search space exploration and processing cost. Thereby, the efficiency of their mining highly depends upon three main factors (1) Database representation approach used for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Data Management and Algorithms · Rough Sets and Fuzzy Logic