Do Deep Networks Transfer Invariances Across Classes?
Allan Zhou, Fahim Tajwar, Alexander Robey, Tom Knowles, George J., Pappas, Hamed Hassani, Chelsea Finn

TL;DR
This paper investigates how neural networks transfer class-agnostic invariances across classes, revealing limitations on small classes and proposing a generative approach to improve invariance transfer and classification performance in imbalanced datasets.
Contribution
The paper demonstrates the poor transfer of invariances from large to small classes and introduces a generative method to enhance invariance transfer in imbalanced image classification.
Findings
Invariance transfer is limited on small classes despite data balancing.
Neural networks' invariance to class-agnostic transformations depends heavily on class size.
Generative approaches can improve invariance transfer and classification accuracy.
Abstract
To generalize well, classifiers must learn to be invariant to nuisance transformations that do not alter an input's class. Many problems have "class-agnostic" nuisance transformations that apply similarly to all classes, such as lighting and background changes for image classification. Neural networks can learn these invariances given sufficient data, but many real-world datasets are heavily class imbalanced and contain only a few examples for most of the classes. We therefore pose the question: how well do neural networks transfer class-agnostic invariances learned from the large classes to the small ones? Through careful experimentation, we observe that invariance to class-agnostic transformations is still heavily dependent on class size, with the networks being much less invariant on smaller classes. This result holds even when using data balancing techniques, and suggests poor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Imbalanced Data Classification Techniques
