A Comparative Study of Feature Selection in Tsetlin Machines
Vojtech Halenka, Ole-Christoffer Granmo, Lei Jiao, and Per-Arne Andersen

TL;DR
This paper evaluates various feature selection methods for Tsetlin Machines, demonstrating that TM-internal scorers are effective and computationally efficient, and establishing a baseline for interpretability in TM models.
Contribution
It adapts and benchmarks multiple feature selection techniques for Tsetlin Machines, including novel TM-specific scorers, and assesses their effectiveness across diverse datasets.
Findings
TM-internal scorers perform competitively in feature importance estimation.
Simpler TM-specific scorers retain accuracy with lower computational cost.
The study provides the first comprehensive baseline for feature selection in Tsetlin Machines.
Abstract
Feature Selection (FS) is crucial for improving model interpretability, reducing complexity, and sometimes for enhancing accuracy. The recently introduced Tsetlin machine (TM) offers interpretable clause-based learning, but lacks established tools for estimating feature importance. In this paper, we adapt and evaluate a range of FS techniques for TMs, including classical filter and embedded methods as well as post-hoc explanation methods originally developed for neural networks (e.g., SHAP and LIME) and a novel family of embedded scorers derived from TM clause weights and Tsetlin automaton (TA) states. We benchmark all methods across 12 datasets, using evaluation protocols, like Remove and Retrain (ROAR) strategy and Remove and Debias (ROAD), to assess causal impact. Our results show that TM-internal scorers not only perform competitively but also exploit the interpretability of clauses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Evolutionary Algorithms and Applications
