Generalized Likelihood Ratio Test for Adversarially Robust Hypothesis Testing
Bhagyashree Puranik, Upamanyu Madhow, Ramtin Pedarsani

TL;DR
This paper introduces a defense method against adversarial attacks in hypothesis testing using the generalized likelihood ratio test (GLRT), demonstrating its effectiveness and robustness in various scenarios, including unknown attack models.
Contribution
The paper develops a GLRT-based defense for adversarial hypothesis testing, extending its application to multi-class problems and analyzing its performance against worst-case and adaptive attacks.
Findings
GLRT approaches minimax defense asymptotically in high dimensions
GLRT offers better robustness-accuracy tradeoff under weaker attacks
Effective in multi-class hypothesis testing with unknown attack models
Abstract
Machine learning models are known to be susceptible to adversarial attacks which can cause misclassification by introducing small but well designed perturbations. In this paper, we consider a classical hypothesis testing problem in order to develop fundamental insight into defending against such adversarial perturbations. We interpret an adversarial perturbation as a nuisance parameter, and propose a defense based on applying the generalized likelihood ratio test (GLRT) to the resulting composite hypothesis testing problem, jointly estimating the class of interest and the adversarial perturbation. While the GLRT approach is applicable to general multi-class hypothesis testing, we first evaluate it for binary hypothesis testing in white Gaussian noise under norm-bounded adversarial perturbations, for which a known minimax defense optimizing for the worst-case attack…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
