Each Attribute Matters: Contrastive Attention for Sentence-based Image   Editing

Liuqing Zhao; Fan Lyu; Fuyuan Hu; Kaizhu Huang; Fenglei Xu; Linyan Li

arXiv:2110.11159·cs.CV·October 22, 2021

Each Attribute Matters: Contrastive Attention for Sentence-based Image Editing

Liuqing Zhao, Fan Lyu, Fuyuan Hu, Kaizhu Huang, Fenglei Xu, Linyan Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces CA-GAN, a contrastive attention model for sentence-based image editing that improves accuracy and attribute-specific editing, especially with multiple attributes, demonstrated on CUB and COCO datasets.

Contribution

The paper proposes a novel contrastive attention module and attribute discriminator to enhance attribute-specific editing in sentence-based image editing.

Findings

01

Effective attribute editing on CUB and COCO datasets

02

Improved accuracy in multi-attribute sentence-based image editing

03

Encouraging qualitative results demonstrating the method's effectiveness

Abstract

Sentence-based Image Editing (SIE) aims to deploy natural language to edit an image. Offering potentials to reduce expensive manual editing, SIE has attracted much interest recently. However, existing methods can hardly produce accurate editing and even lead to failures in attribute editing when the query sentence is with multiple editable attributes. To cope with this problem, by focusing on enhancing the difference between attributes, this paper proposes a novel model called Contrastive Attention Generative Adversarial Network (CA-GAN), which is inspired from contrastive training. Specifically, we first design a novel contrastive attention module to enlarge the editing difference between random combinations of attributes which are formed during training. We then construct an attribute discriminator to ensure effective editing on each attribute. A series of experiments show that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zlq2021/ca-gan
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning