A causal view of compositional zero-shot recognition
Yuval Atzmon, Felix Kreuk, Uri Shalit, Gal Chechik

TL;DR
This paper introduces a causal approach to compositional zero-shot recognition, enabling models to better generalize to new attribute-object combinations by learning disentangled, causally-inspired representations.
Contribution
It proposes a causal perspective for zero-shot learning and develops a causal-inspired embedding model that improves compositional generalization.
Findings
Improved accuracy on synthesized attribute-object datasets
Enhanced generalization to unseen attribute-object pairs
Causal-inspired embeddings outperform baselines
Abstract
People easily recognize new visual categories that are new combinations of known components. This compositional generalization capacity is critical for learning in real-world domains like vision and language because the long tail of new combinations dominates the distribution. Unfortunately, learning systems struggle with compositional generalization because they often build on features that are correlated with class labels even if they are not "essential" for the class. This leads to consistent misclassification of samples from a new distribution, like new combinations of known components. Here we describe an approach for compositional generalization that builds on causal ideas. First, we describe compositional zero-shot learning from a causal perspective, and propose to view zero-shot inference as finding "which intervention caused the image?". Second, we present a causal-inspired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI
