Volatility in Certainty (VC): A Metric for Detecting Adversarial Perturbations During Inference in Neural Network Classifiers

Vahid Hemmati; Ahmad Mohammadi; Abdul-Rauf Nuhu; Reza Ahmari; Parham Kebria; Abdollah Homaifar

arXiv:2511.11834·cs.LG·November 18, 2025

Volatility in Certainty (VC): A Metric for Detecting Adversarial Perturbations During Inference in Neural Network Classifiers

Vahid Hemmati, Ahmad Mohammadi, Abdul-Rauf Nuhu, Reza Ahmari, Parham Kebria, Abdollah Homaifar

PDF

Open Access

TL;DR

This paper introduces Volatility in Certainty (VC), a label-free metric that detects adversarial perturbations by measuring confidence fluctuations, effectively predicting model performance degradation in real-time without needing ground-truth labels.

Contribution

The paper proposes VC as a novel, architecture-agnostic metric for real-time detection of adversarial perturbations and distribution shifts during inference.

Findings

01

VC strongly correlates with accuracy degradation (rho < -0.90).

02

VC effectively detects adversarial and distribution shifts.

03

VC operates without labeled data, enabling real-time monitoring.

Abstract

Adversarial robustness remains a critical challenge in deploying neural network classifiers, particularly in real-time systems where ground-truth labels are unavailable during inference. This paper investigates \textit{Volatility in Certainty} (VC), a recently proposed, label-free metric that quantifies irregularities in model confidence by measuring the dispersion of sorted softmax outputs. Specifically, VC is defined as the average squared log-ratio of adjacent certainty values, capturing local fluctuations in model output smoothness. We evaluate VC as a proxy for classification accuracy and as an indicator of adversarial drift. Experiments are conducted on artificial neural networks (ANNs) and convolutional neural networks (CNNs) trained on MNIST, as well as a regularized VGG-like model trained on CIFAR-10. Adversarial examples are generated using the Fast Gradient Sign Method (FGSM)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications