Reject Illegal Inputs with Generative Classifier Derived from Any   Discriminative Classifier

Xin Wang

arXiv:2001.00483·cs.LG·January 3, 2020

Reject Illegal Inputs with Generative Classifier Derived from Any Discriminative Classifier

Xin Wang

PDF

Open Access

TL;DR

This paper introduces SDIM-logit, a method that enhances generative classifiers derived from discriminative classifiers to effectively reject illegal inputs like adversarial and out-of-distribution samples, improving detection performance.

Contribution

SDIM-logit modifies SDIM by leveraging logits from discriminative classifiers, inheriting their performance with minimal additional parameters and enabling efficient rejection of illegal inputs.

Findings

01

Significantly improves illegal input detection when rejecting uncertain samples.

02

Efficiently inherits discriminative classifier performance without retraining from scratch.

03

Effective against adversarial, corrupted, and out-of-distribution samples.

Abstract

Generative classifiers have been shown promising to detect illegal inputs including adversarial examples and out-of-distribution samples. Supervised Deep Infomax~(SDIM) is a scalable end-to-end framework to learn generative classifiers. In this paper, we propose a modification of SDIM termed SDIM-\emph{logit}. Instead of training generative classifier from scratch, SDIM-\emph{logit} first takes as input the logits produced any given discriminative classifier, and generate logit representations; then a generative classifier is derived by imposing statistical constraints on logit representations. SDIM-\emph{logit} could inherit the performance of the discriminative classifier without loss. SDIM-\emph{logit} incurs a negligible number of additional parameters, and can be efficiently trained with base classifiers fixed. We perform \emph{classification with rejection}, where test samples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis

MethodsTest