NeuroShield: A Neuro-Symbolic Framework for Adversarial Robustness

Ali Shafiee Sarvestani; Jason Schmidt; Arman Roohi

arXiv:2601.13162·cs.LG·January 21, 2026

NeuroShield: A Neuro-Symbolic Framework for Adversarial Robustness

Ali Shafiee Sarvestani, Jason Schmidt, Arman Roohi

PDF

Open Access

TL;DR

NeuroShield introduces a neuro-symbolic framework that enhances adversarial robustness and interpretability of neural networks by integrating symbolic rule supervision, demonstrating significant improvements over standard adversarial training on the GTSRB dataset.

Contribution

The paper proposes a novel neuro-symbolic framework, extbackslash DesignII, that incorporates symbolic rules into neural network training to improve robustness and explainability.

Findings

01

Achieves 18.1 ext{ extperthousand} and 17.35 ext{ extperthousand} robustness gains over adversarial baselines.

02

Outperforms transformer-based defenses with a ResNet18 backbone.

03

Maintains clean-sample accuracy while significantly improving adversarial robustness.

Abstract

Adversarial vulnerability and lack of interpretability are critical limitations of deep neural networks, especially in safety-sensitive settings such as autonomous driving. We introduce \DesignII, a neuro-symbolic framework that integrates symbolic rule supervision into neural networks to enhance both adversarial robustness and explainability. Domain knowledge is encoded as logical constraints over appearance attributes such as shape and color, and enforced through semantic and symbolic logic losses applied during training. Using the GTSRB dataset, we evaluate robustness against FGSM and PGD attacks at a standard $ℓ_{\infty}$ perturbation budget of $ε = 8/255$ . Relative to clean training, standard adversarial training provides modest improvements in robustness ( $\sim$ 10 percentage points). Conversely, our FGSM-Neuro-Symbolic and PGD-Neuro-Symbolic models achieve substantially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI