Modularity Trumps Invariance for Compositional Robustness
Ian Mason, Anirban Sarkar, Tomotake Sasaki, Xavier Boix

TL;DR
This paper demonstrates that modular neural network architectures outperform invariant models in achieving compositional robustness to combined corruptions in image classification tasks.
Contribution
It introduces a modular architecture that reflects data structure, showing improved compositional robustness over invariant approaches in corrupted image classification.
Findings
Modular architectures outperform invariant models in compositional robustness.
Invariance between elemental corruptions does not predict robustness to their compositions.
Marginal improvements are achieved by invariance-based contrastive loss methods.
Abstract
By default neural networks are not robust to changes in data distribution. This has been demonstrated with simple image corruptions, such as blurring or adding noise, degrading image classification performance. Many methods have been proposed to mitigate these issues but for the most part models are evaluated on single corruptions. In reality, visual space is compositional in nature, that is, that as well as robustness to elemental corruptions, robustness to compositions of corruptions is also needed. In this work we develop a compositional image classification task where, given a few elemental corruptions, models are asked to generalize to compositions of these corruptions. That is, to achieve compositional robustness. We experimentally compare empirical risk minimization with an invariance building pairwise contrastive loss and, counter to common intuitions in domain generalization,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
