Unsupervised Adversarial Detection without Extra Model: Training Loss   Should Change

Chien Cheng Chyou; Hung-Ting Su; Winston H. Hsu

arXiv:2308.03243·cs.LG·August 8, 2023

Unsupervised Adversarial Detection without Extra Model: Training Loss Should Change

Chien Cheng Chyou, Hung-Ting Su, Winston H. Hsu

PDF

Open Access 1 Repo

TL;DR

This paper introduces new training losses that improve unsupervised adversarial detection accuracy without extra models, effectively reducing false positives and maintaining high detection rates across various attack types.

Contribution

It proposes novel training losses that diminish useless features and enable effective unsupervised adversarial detection without prior attack knowledge.

Findings

01

Detection rate above 93.9% for most attacks

02

False positive rate around 2.5%

03

Effective across diverse attack types

Abstract

Adversarial robustness poses a critical challenge in the deployment of deep learning models for real-world applications. Traditional approaches to adversarial training and supervised detection rely on prior knowledge of attack types and access to labeled training data, which is often impractical. Existing unsupervised adversarial detection methods identify whether the target model works properly, but they suffer from bad accuracies owing to the use of common cross-entropy training loss, which relies on unnecessary features and strengthens adversarial attacks. We propose new training losses to reduce useless features and the corresponding detection method without prior knowledge of adversarial attacks. The detection rate (true positive rate) against all given white-box attacks is above 93.9% except for attacks without limits (DF( $\infty$ )), while the false positive rate is barely 2.5%.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cyclebooster/unsupervised-adversarial-detection-without-extra-model
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications