Robust Adversarial Quantification via Conflict-Aware Evidential Deep Learning

Charmaine Barker; Daniel Bethell; Simos Gerasimou

arXiv:2506.05937·cs.LG·March 5, 2026

Robust Adversarial Quantification via Conflict-Aware Evidential Deep Learning

Charmaine Barker, Daniel Bethell, Simos Gerasimou

PDF

Open Access 3 Reviews

TL;DR

This paper introduces C-EDL, a lightweight post-hoc method that enhances the robustness of evidential deep learning models against adversarial and out-of-distribution inputs by leveraging conflict-aware uncertainty calibration.

Contribution

C-EDL is a novel, post-hoc approach that improves adversarial and OOD robustness of EDL models without retraining, using conflict-aware transformations and disagreement quantification.

Findings

01

C-EDL significantly reduces coverage for OOD data by up to 55%.

02

C-EDL achieves up to 90% reduction in coverage for adversarial data.

03

C-EDL maintains high in-distribution accuracy with low computational overhead.

Abstract

Reliability of deep learning models is critical for deployment in high-stakes applications, where out-of-distribution or adversarial inputs may lead to detrimental outcomes. Evidential Deep Learning, an efficient paradigm for uncertainty quantification, models predictions as Dirichlet distributions of a single forward pass. However, EDL is particularly vulnerable to adversarially perturbed inputs, making overconfident errors. Conflict-aware Evidential Deep Learning (C-EDL) is a lightweight post-hoc uncertainty quantification approach that mitigates these issues, enhancing adversarial and OOD robustness without retraining. C-EDL generates diverse, task-preserving transformations per input and quantifies representational disagreement to calibrate uncertainty estimates when needed. C-EDL's conflict-aware prediction adjustment improves detection of OOD and adversarial inputs, maintaining…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 4

Strengths

Originality: The paper introduces a conflict-aware uncertainty adjustment mechanism for evidential deep learning (EDL), which is a meaningful extension to prior post-hoc calibration and OOD robustness methods. The idea of leveraging metamorphic transformations to measure evidence disagreement is both conceptually intuitive and novel within the EDL literature. • Quality: The experimental evaluation is comprehensive, spanning multiple datasets (MNIST, CIFAR10, Flowers102, etc.) and adversarial se

Weaknesses

Limited Theoretical Depth: While the paper provides an intuitive justification and basic boundedness proof for the conflict metric, the mathematical grounding of how conflict relates to epistemic and aleatoric uncertainty decomposition is underdeveloped. A stronger theoretical connection to Dempster–Shafer theory or Bayesian evidence accumulation would strengthen the claim of principled robustness. • Overlap with Prior Work: The contribution, though practical, feels incremental relative to rece

Reviewer 02Rating 6Confidence 3

Strengths

1. C-EDL is simple and clear, which achieves promising results. 2. The conflict measure $C$ is formally bounded, providing a principled basis for evidence adjustment. 3. This work provides a comprehensive evaluation.

Weaknesses

1. Standard EDL was originally used for the most basic classification tasks. Can the proposed C-EDL be directly applied to improve the level of uncertainty quantification and performance? 2. The citation format needs to be modified to improve readability; for example, using \citep. 3. C-EDL essentially utilizes multiple views and improves the accuracy of uncertainty quantification by quantifying the conflict of these views. However, there is a lack of discussion regarding some related work, e.g.

Reviewer 03Rating 2Confidence 4

Strengths

1. The proposed method is post-hoc and architecture-agnostic. It can be directly applied to any trained EDL model without modifying the network structure or retraining, making it highly practical and easy to integrate. 2. The approach is computationally efficient compared to full retraining or Bayesian ensemble methods, as it only involves a few forward passes with simple input transformations.

Weaknesses

1. The paper lacks sufficient implementation details for reproducibility. For example, the specific metamorphic transformations applied to each dataset, the exact uncertainty metric used in Table 1, and the threshold definition for coverage reporting are not clearly specified. The choice and tuning process for hyperparameters (e.g., conflict weighting and decay factors) should also be explained. 2. The motivation requires further clarification. EDL naturally distinguishes between aleatoric and e

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Explainable Artificial Intelligence (XAI)