Learning Rules-First Classifiers
Deborah Cohen, Amit Daniely, Amir Globerson, Gal Elidan

TL;DR
This paper introduces a new rule-based classifier that combines rule-based and linear models, providing a simple, efficient, and interpretable approach with near-optimal sample complexity, validated through experiments.
Contribution
It defines a novel hypothesis class for rule-based classifiers, analyzes its sample complexity, and presents an efficient algorithm with near-optimal sample complexity, supported by empirical results.
Findings
The proposed method achieves high accuracy on synthetic and sentiment analysis data.
It offers improved interpretability over traditional classifiers.
The algorithm's sample complexity is close to theoretical optimal among efficient algorithms.
Abstract
Complex classifiers may exhibit "embarassing" failures in cases where humans can easily provide a justified classification. Avoiding such failures is obviously of key importance. In this work, we focus on one such setting, where a label is perfectly predictable if the input contains certain features, or rules, and otherwise it is predictable by a linear classifier. We define a hypothesis class that captures this notion and determine its sample complexity. We also give evidence that efficient algorithms cannot achieve this sample complexity. We then derive a simple and efficient algorithm and show that its sample complexity is close to optimal, among efficient algorithms. Experiments on synthetic and sentiment analysis data demonstrate the efficacy of the method, both in terms of accuracy and interpretability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Data Stream Mining Techniques · Machine Learning and Data Classification
