Attributes as Operators: Factorizing Unseen Attribute-Object Compositions
Tushar Nagarajan, Kristen Grauman

TL;DR
This paper introduces a novel method modeling visual attributes as operators, enabling better recognition of unseen attribute-object combinations by explicitly separating attributes from objects in a semantic embedding.
Contribution
The paper proposes modeling attributes as operators within a semantic space, improving generalization to unseen attribute-object compositions and incorporating attribute effects with new regularizers.
Findings
Significant improvements over state-of-the-art on two datasets
Robust recognition of unseen attribute-object compositions
Generalization to unseen objects during training
Abstract
We present a new approach to modeling visual attributes. Prior work casts attributes in a similar role as objects, learning a latent representation where properties (e.g., sliced) are recognized by classifiers much in the way objects (e.g., apple) are. However, this common approach fails to separate the attributes observed during training from the objects with which they are composed, making it ineffectual when encountering new attribute-object compositions. Instead, we propose to model attributes as operators. Our approach learns a semantic embedding that explicitly factors out attributes from their accompanying objects, and also benefits from novel regularizers expressing attribute operators' effects (e.g., blunt should undo the effects of sharp). Not only does our approach align conceptually with the linguistic role of attributes as modifiers, but it also generalizes to recognize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques
