Delta-Closure Structure for Studying Data Distribution
Aleksey Buzmakov, Tatiana Makhalova, Sergei O. Kuznetsov, Amedeo, Napoli

TL;DR
This paper introduces the $\Delta$-closure structure, a novel framework for analyzing data distribution in binary datasets through generalized closure operators, revealing stable attribute correlations.
Contribution
It proposes $\Delta$-closedness and the $\Delta$-closure structure, providing new tools to characterize and interpret data distributions and attribute correlations.
Findings
The $\Delta$-closure structure is stable across different datasets and sampling methods.
High $\Delta$-classes indicate strong attribute correlations supported by many observations.
The framework generalizes traditional closure operators to better understand data distribution.
Abstract
In this paper, we revisit pattern mining and study the distribution underlying a binary dataset thanks to the closure structure which is based on passkeys, i.e., minimum generators in equivalence classes robust to noise. We introduce -closedness, a generalization of the closure operator, where measures how a closed set differs from its upper neighbors in the partial order induced by closure. A -class of equivalence includes minimum and maximum elements and allows us to characterize the distribution underlying the data. Moreover, the set of -classes of equivalence can be partitioned into the so-called -closure structure. In particular, a -class of equivalence with a high level demonstrates correlations among many attributes, which are supported by more observations when is large. In the experiments, we study the -closure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Data Mining Algorithms and Applications · Imbalanced Data Classification Techniques
