Enhancing Adversarial Robustness with Conformal Prediction: A Framework for Guaranteed Model Reliability

Jie Bao; Chuangyin Dang; Rui Luo; Hanwei Zhang; Zhixin Zhou

arXiv:2506.07804·cs.LG·June 10, 2025

Enhancing Adversarial Robustness with Conformal Prediction: A Framework for Guaranteed Model Reliability

Jie Bao, Chuangyin Dang, Rui Luo, Hanwei Zhang, Zhixin Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel framework combining conformal prediction with adversarial training to improve the robustness and reliability of deep learning models against adversarial attacks, especially in safety-critical applications.

Contribution

It develops OPSA, an adversarial attack method targeting conformal prediction, and OPSA-AT, a new adversarial training strategy that enhances model robustness and uncertainty estimation.

Findings

01

OPSA induces greater uncertainty than baseline attacks.

02

OPSA-AT improves robustness against multiple adversarial attacks.

03

The integrated approach provides reliable predictions in high-risk scenarios.

Abstract

As deep learning models are increasingly deployed in high-risk applications, robust defenses against adversarial attacks and reliable performance guarantees become paramount. Moreover, accuracy alone does not provide sufficient assurance or reliable uncertainty estimates for these models. This study advances adversarial training by leveraging principles from Conformal Prediction. Specifically, we develop an adversarial attack method, termed OPSA (OPtimal Size Attack), designed to reduce the efficiency of conformal prediction at any significance level by maximizing model uncertainty without requiring coverage guarantees. Correspondingly, we introduce OPSA-AT (Adversarial Training), a defense strategy that integrates OPSA within a novel conformal training paradigm. Experimental evaluations demonstrate that our OPSA attack method induces greater uncertainty compared to baseline approaches…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bjbbbb/enhancing-adversarial-robustness-with-conformal-prediction
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI