Limitations of Post-Hoc Feature Alignment for Robustness

Collin Burns; Jacob Steinhardt

arXiv:2103.05898·cs.CV·March 11, 2021

Limitations of Post-Hoc Feature Alignment for Robustness

Collin Burns, Jacob Steinhardt

PDF

1 Repo

TL;DR

This paper critically examines the effectiveness of post-hoc feature alignment, specifically batch normalization statistics matching, revealing its limited benefits and potential drawbacks for robustness against distribution shifts.

Contribution

The study provides a detailed analysis of the limitations of feature alignment methods, especially batch normalization alignment, and explains why they may not reliably improve robustness.

Findings

01

Only helps with specific distribution shifts

02

Can degrade performance in some settings

03

Challenges the utility of feature alignment for robustness

Abstract

Feature alignment is an approach to improving robustness to distribution shift that matches the distribution of feature activations between the training distribution and test distribution. A particularly simple but effective approach to feature alignment involves aligning the batch normalization statistics between the two distributions in a trained neural network. This technique has received renewed interest lately because of its impressive performance on robustness benchmarks. However, when and why this method works is not well understood. We investigate the approach in more detail and identify several limitations. We show that it only significantly helps with a narrow set of distribution shifts and we identify several settings in which it even degrades performance. We also explain why these limitations arise by pinpointing why this approach can be so effective in the first place. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

collin-burns/feature-alignment
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBatch Normalization