Formal and Computational Properties of the Confidence Boost of Association Rules
Jos\'e L. Balc\'azar

TL;DR
This paper introduces the confidence boost measure for association rules, enhancing the detection of interesting rules by quantifying their novelty relative to others, and provides algorithms and an open-source tool for its practical application.
Contribution
It proposes a new measure called confidence boost to evaluate rule novelty, along with efficient algorithms and an open-source tool for its implementation.
Findings
Confidence boost effectively identifies novel and interesting association rules.
The measure helps eliminate rules of negative correlation that pass traditional confidence thresholds.
Experimental results on benchmark datasets demonstrate the utility of confidence boost in rule mining.
Abstract
Some existing notions of redundancy among association rules allow for a logical-style characterization and lead to irredundant bases of absolutely minimum size. One can push the intuition of redundancy further and find an intuitive notion of interest of an association rule, in terms of its "novelty" with respect to other rules. Namely: an irredundant rule is so because its confidence is higher than what the rest of the rules would suggest; then, one can ask: how much higher? We propose to measure such a sort of "novelty" through the confidence boost of a rule, which encompasses two previous similar notions (confidence width and rule blocking, of which the latter is closely related to the earlier measure "improvement"). Acting as a complement to confidence and support, the confidence boost helps to obtain small and crisp sets of mined association rules, and solves the well-known problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Natural Language Processing Techniques
