Entropy-Based Non-Invasive Reliability Monitoring of Convolutional Neural Networks

Amirhossein Nazeri; Wael Hafez

arXiv:2508.21715·cs.CV·September 1, 2025

Entropy-Based Non-Invasive Reliability Monitoring of Convolutional Neural Networks

Amirhossein Nazeri, Wael Hafez

PDF

Open Access

TL;DR

This paper introduces a method to detect adversarial attacks on CNNs by monitoring entropy changes in activations, enabling real-time, non-invasive reliability assessment without retraining or architecture changes.

Contribution

It demonstrates that adversarial inputs cause measurable entropy shifts in CNN activations, allowing effective detection without modifying the model.

Findings

01

Adversarial inputs increase activation entropy by 7% in early layers.

02

Detection accuracy of 90% with low false positive/negative rates.

03

Entropy distributions for clean and adversarial inputs are well separated.

Abstract

Convolutional Neural Networks (CNNs) have become the foundation of modern computer vision, achieving unprecedented accuracy across diverse image recognition tasks. While these networks excel on in-distribution data, they remain vulnerable to adversarial perturbations imperceptible input modifications that cause misclassification with high confidence. However, existing detection methods either require expensive retraining, modify network architecture, or degrade performance on clean inputs. Here we show that adversarial perturbations create immediate, detectable entropy signatures in CNN activations that can be monitored without any model modification. Using parallel entropy monitoring on VGG-16, we demonstrate that adversarial inputs consistently shift activation entropy by 7% in early convolutional layers, enabling 90% detection accuracy with false positives and false negative rates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Explainable Artificial Intelligence (XAI)