NeuroShield: A Neuro-Symbolic Framework for Adversarial Robustness
Ali Shafiee Sarvestani, Jason Schmidt, Arman Roohi

TL;DR
NeuroShield introduces a neuro-symbolic framework that enhances adversarial robustness and interpretability of neural networks by integrating symbolic rule supervision, demonstrating significant improvements over standard adversarial training on the GTSRB dataset.
Contribution
The paper proposes a novel neuro-symbolic framework, extbackslash DesignII, that incorporates symbolic rules into neural network training to improve robustness and explainability.
Findings
Achieves 18.1 ext{ extperthousand} and 17.35 ext{ extperthousand} robustness gains over adversarial baselines.
Outperforms transformer-based defenses with a ResNet18 backbone.
Maintains clean-sample accuracy while significantly improving adversarial robustness.
Abstract
Adversarial vulnerability and lack of interpretability are critical limitations of deep neural networks, especially in safety-sensitive settings such as autonomous driving. We introduce \DesignII, a neuro-symbolic framework that integrates symbolic rule supervision into neural networks to enhance both adversarial robustness and explainability. Domain knowledge is encoded as logical constraints over appearance attributes such as shape and color, and enforced through semantic and symbolic logic losses applied during training. Using the GTSRB dataset, we evaluate robustness against FGSM and PGD attacks at a standard perturbation budget of . Relative to clean training, standard adversarial training provides modest improvements in robustness (10 percentage points). Conversely, our FGSM-Neuro-Symbolic and PGD-Neuro-Symbolic models achieve substantially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
