Improving Interpretability and Accuracy in Neuro-Symbolic Rule   Extraction Using Class-Specific Sparse Filters

Parth Padalkar; Jaeseong Lee; Shiyi Wei; Gopal Gupta

arXiv:2501.16677·cs.CV·January 29, 2025

Improving Interpretability and Accuracy in Neuro-Symbolic Rule Extraction Using Class-Specific Sparse Filters

Parth Padalkar, Jaeseong Lee, Shiyi Wei, Gopal Gupta

PDF

Open Access

TL;DR

This paper introduces a novel training method with class-specific sparse filters that improves the accuracy and interpretability of neuro-symbolic rule extraction from CNNs, achieving state-of-the-art results.

Contribution

It proposes a new sparsity loss function for CNN training that preserves information during rule extraction, enhancing interpretability without sacrificing much accuracy.

Findings

01

Achieved 9% higher accuracy than previous methods.

02

Reduced rule-set size by 53%.

03

Within 3% of original CNN accuracy.

Abstract

There has been significant focus on creating neuro-symbolic models for interpretable image classification using Convolutional Neural Networks (CNNs). These methods aim to replace the CNN with a neuro-symbolic model consisting of the CNN, which is used as a feature extractor, and an interpretable rule-set extracted from the CNN itself. While these approaches provide interpretability through the extracted rule-set, they often compromise accuracy compared to the original CNN model. In this paper, we identify the root cause of this accuracy loss as the post-training binarization of filter activations to extract the rule-set. To address this, we propose a novel sparsity loss function that enables class-specific filter binarization during CNN training, thus minimizing information loss when extracting the rule-set. We evaluate several training strategies with our novel sparsity loss, analyzing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fuzzy Logic and Control Systems

MethodsSparse Evolutionary Training · Focus