The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks
Ziquan Liu, Yufei Cui, Yan Yan, Yi Xu, Xiangyang Ji, Xue Liu, Antoni, B. Chan

TL;DR
This paper examines the effectiveness of conformal prediction in providing reliable uncertainty quantification under adversarial attacks, revealing limitations with standard methods and proposing an improved adversarial training approach for better uncertainty estimates.
Contribution
It uncovers the limitations of existing conformal prediction methods under adversarial attacks and introduces an entropy-regularized adversarial training technique to enhance uncertainty quantification.
Findings
CP fails to produce informative sets without adversarial training.
Adversarial training improves CP performance but can increase prediction set size.
Proposed AT-UR method reduces uncertainty and improves prediction set efficiency.
Abstract
In safety-critical applications such as medical imaging and autonomous driving, where decisions have profound implications for patient health and road safety, it is imperative to maintain both high adversarial robustness to protect against potential adversarial attacks and reliable uncertainty quantification in decision-making. With extensive research focused on enhancing adversarial robustness through various forms of adversarial training (AT), a notable knowledge gap remains concerning the uncertainty inherent in adversarially trained models. To address this gap, this study investigates the uncertainty of deep learning models by examining the performance of conformal prediction (CP) in the context of standard adversarial attacks within the adversarial defense community. It is first unveiled that existing CP methods do not produce informative prediction sets under the commonly used…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security
MethodsSparse Evolutionary Training
