Scalable Rule-Based Representation Learning for Interpretable Classification
Zhuo Wang, Wei Zhang, Ning Liu, Jianyong Wang

TL;DR
This paper introduces RRL, a scalable and interpretable rule-based classifier that uses a novel Gradient Grafting training method to optimize non-differentiable rules directly, balancing accuracy and interpretability.
Contribution
The paper proposes RRL, a new rule-based classifier with a gradient-based training method for non-differentiable rules, enhancing scalability and interpretability.
Findings
RRL outperforms existing interpretable models on multiple datasets.
RRL effectively balances accuracy and model complexity.
The method scales well to large datasets.
Abstract
Rule-based models, e.g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity. However, rule-based models are hard to optimize, especially on large data sets, due to their discrete parameters and structures. Ensemble methods and fuzzy/soft rules are commonly used to improve performance, but they sacrifice the model interpretability. To obtain both good scalability and interpretability, we propose a new classifier, named Rule-based Representation Learner (RRL), that automatically learns interpretable non-fuzzy rules for data representation and classification. To train the non-differentiable RRL effectively, we project it to a continuous space and propose a novel training method, called Gradient Grafting, that can directly optimize the discrete model using gradient descent. An improved design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Data Stream Mining Techniques
