A Neurosymbolic Framework for Bias Correction in Convolutional Neural Networks
Parth Padalkar, Natalia \'Slusarz, Ekaterina Komendantskaya, Gopal, Gupta

TL;DR
This paper presents NeSyBiCor, a neurosymbolic framework that corrects biases in CNNs by integrating symbolic concepts and retraining with a semantic similarity loss, enhancing interpretability with minimal accuracy loss.
Contribution
The paper introduces a novel neurosymbolic approach for bias correction in CNNs using ASP constraints and a semantic similarity loss during retraining.
Findings
Successfully corrects biases in CNNs trained on subset classes from Places dataset.
Maintains high accuracy while improving interpretability.
Demonstrates effectiveness of the framework in bias correction.
Abstract
Recent efforts in interpreting Convolutional Neural Networks (CNNs) focus on translating the activation of CNN filters into a stratified Answer Set Program (ASP) rule-sets. The CNN filters are known to capture high-level image concepts, thus the predicates in the rule-set are mapped to the concept that their corresponding filter represents. Hence, the rule-set exemplifies the decision-making process of the CNN w.r.t the concepts that it learns for any image classification task. These rule-sets help understand the biases in CNNs, although correcting the biases remains a challenge. We introduce a neurosymbolic framework called NeSyBiCor for bias correction in a trained CNN. Given symbolic concepts, as ASP constraints, that the CNN is biased towards, we convert the concepts to their corresponding vector representations. Then, the CNN is retrained using our novel semantic similarity loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques
MethodsSparse Evolutionary Training · Focus
