MEMO: Test Time Robustness via Adaptation and Augmentation
Marvin Zhang, Sergey Levine, Chelsea Finn

TL;DR
This paper introduces a simple test time adaptation method that improves neural network robustness against distribution shifts by augmenting data and minimizing entropy of the model's output, without requiring assumptions about training.
Contribution
The authors propose a broadly applicable test time adaptation technique that enhances robustness by using data augmentation and entropy minimization, requiring no prior training assumptions.
Findings
Achieves 1-8% accuracy improvement over standard evaluation.
Outperforms prior strategies on robustness benchmarks.
Sets state-of-the-art results on ImageNet-C, ImageNet-R, and ImageNet-A.
Abstract
While deep neural networks can attain good accuracy on in-distribution test points, many applications require robustness even in the face of unexpected perturbations in the input, changes in the domain, or other sources of distribution shift. We study the problem of test time robustification, i.e., using the test input to improve model robustness. Recent prior works have proposed methods for test time adaptation, however, they each introduce additional assumptions, such as access to multiple test points, that prevent widespread adoption. In this work, we aim to study and devise methods that make no assumptions about the model training process and are broadly applicable at test time. We propose a simple approach that can be used in any test setting where the model is probabilistic and adaptable: when presented with a test example, perform different data augmentations on the data point,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsMulti-Head Attention · Attention Is All You Need · Test · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · 1x1 Convolution · Softmax · Batch Normalization · Residual Connection · Convolution
