Interpretable DNFs
Martin C. Cooper, Imane Bousdira, Cl\'ement Carbonnel

TL;DR
This paper investigates interpretable DNF formulas, focusing on models where both the formula and its negation are expressible as small DNFs, and compares nested k-DNFs with decision trees for interpretability and accuracy.
Contribution
It introduces nested k-DNFs as a new family of models and compares their interpretability and accuracy to depth-k decision trees.
Findings
Nested k-DNFs are a promising alternative to decision trees.
Experiments show comparable interpretability and accuracy.
Both models facilitate explanations of positive and negative decisions.
Abstract
A classifier is considered interpretable if each of its decisions has an explanation which is small enough to be easily understood by a human user. A DNF formula can be seen as a binary classifier over boolean domains. The size of an explanation of a positive decision taken by a DNF is bounded by the size of the terms in , since we can explain a positive decision by giving a term of that evaluates to true. Since both positive and negative decisions must be explained, we consider that interpretable DNFs are those for which both and can be expressed as DNFs composed of terms of bounded size. In this paper, we study the family of -DNFs whose complements can also be expressed as -DNFs. We compare two such families, namely depth- decision trees and nested -DNFs, a novel family of models. Experiments indicate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning
