Specious rules: an efficient and effective unifying method for removing   misleading and uninformative patterns in association rule mining

Wilhelmiina H\"am\"al\"ainen; Geoffrey I. Webb

arXiv:1709.03915·cs.AI·September 13, 2017

Specious rules: an efficient and effective unifying method for removing misleading and uninformative patterns in association rule mining

Wilhelmiina H\"am\"al\"ainen, Geoffrey I. Webb

PDF

TL;DR

This paper introduces a unifying method to identify and remove misleading, redundant association rules called specious rules, significantly improving the quality of association rule mining results.

Contribution

It provides a theoretical framework and practical algorithms for detecting and pruning specious rules, addressing a broad class of misleading associations in data mining.

Findings

01

Reduces the number of misleading association rules

02

Effective across multiple goodness measures

03

Improves the interpretability of mined rules

Abstract

We present theoretical analysis and a suite of tests and procedures for addressing a broad class of redundant and misleading association rules we call \emph{specious rules}. Specious dependencies, also known as \emph{spurious}, \emph{apparent}, or \emph{illusory associations}, refer to a well-known phenomenon where marginal dependencies are merely products of interactions with other variables and disappear when conditioned on those variables. The most extreme example is Yule-Simpson's paradox where two variables present positive dependence in the marginal contingency table but negative in all partial tables defined by different levels of a confounding factor. It is accepted wisdom that in data of any nontrivial dimensionality it is infeasible to control for all of the exponentially many possible confounds of this nature. In this paper, we consider the problem of specious dependencies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning