A causal view of compositional zero-shot recognition

Yuval Atzmon; Felix Kreuk; Uri Shalit; Gal Chechik

arXiv:2006.14610·cs.CV·November 3, 2020·59 cites

A causal view of compositional zero-shot recognition

Yuval Atzmon, Felix Kreuk, Uri Shalit, Gal Chechik

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a causal approach to compositional zero-shot recognition, enabling models to better generalize to new attribute-object combinations by learning disentangled, causally-inspired representations.

Contribution

It proposes a causal perspective for zero-shot learning and develops a causal-inspired embedding model that improves compositional generalization.

Findings

01

Improved accuracy on synthesized attribute-object datasets

02

Enhanced generalization to unseen attribute-object pairs

03

Causal-inspired embeddings outperform baselines

Abstract

People easily recognize new visual categories that are new combinations of known components. This compositional generalization capacity is critical for learning in real-world domains like vision and language because the long tail of new combinations dominates the distribution. Unfortunately, learning systems struggle with compositional generalization because they often build on features that are correlated with class labels even if they are not "essential" for the class. This leads to consistent misclassification of samples from a new distribution, like new combinations of known components. Here we describe an approach for compositional generalization that builds on causal ideas. First, we describe compositional zero-shot learning from a causal perspective, and propose to view zero-shot inference as finding "which intervention caused the image?". Second, we present a causal-inspired…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nv-research-israel/causal_comp
pytorchOfficial

Videos

A causal view of compositional zero-shot recognition· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI