Learning Attention Propagation for Compositional Zero-Shot Learning
Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Alain, Pagani, Didier Stricker, Muhammad Zeshan Afzal

TL;DR
This paper introduces CAPE, a novel attention-based method that models complex dependencies between compositions of visual primitives, significantly improving zero-shot recognition of unseen object-state combinations.
Contribution
The paper proposes CAPE, a new approach that captures rich dependency structures among compositions, enabling better generalization to unseen combinations in compositional zero-shot learning.
Findings
Outperforms previous methods on three benchmarks.
Sets new state-of-the-art in generalized compositional zero-shot learning.
Effectively models complex interactions between primitives.
Abstract
Compositional zero-shot learning aims to recognize unseen compositions of seen visual primitives of object classes and their states. While all primitives (states and objects) are observable during training in some combination, their complex interaction makes this task especially hard. For example, wet changes the visual appearance of a dog very differently from a bicycle. Furthermore, we argue that relationships between compositions go beyond shared states or objects. A cluttered office can contain a busy table; even though these compositions don't share a state or object, the presence of a busy table can guide the presence of a cluttered office. We propose a novel method called Compositional Attention Propagated Embedding (CAPE) as a solution. The key intuition to our method is that a rich dependency structure exists between compositions arising from complex interactions of primitives…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Learning Attention Propagation for Compositional Zero-Shot Learning· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Multimodal Machine Learning Applications
