FA-GAN: Feature-Aware GAN for Text to Image Synthesis

Eunyeong Jeon; Kunhee Kim; Daijin Kim

arXiv:2109.00907·cs.CV·September 3, 2021

FA-GAN: Feature-Aware GAN for Text to Image Synthesis

Eunyeong Jeon, Kunhee Kim, Daijin Kim

PDF

Open Access

TL;DR

FA-GAN introduces a feature-aware GAN framework with a self-supervised discriminator and feature loss, significantly improving the quality of text-to-image synthesis by producing clearer textures and intact objects.

Contribution

The paper proposes a novel FA-GAN model that integrates a self-supervised discriminator and feature-aware loss to enhance image quality in text-to-image synthesis.

Findings

01

Achieved a lower FID score of 24.58 on MS-COCO, indicating higher image quality.

02

Outperformed previous state-of-the-art methods in text-to-image synthesis.

03

Demonstrated improved object integrity and texture clarity in generated images.

Abstract

Text-to-image synthesis aims to generate a photo-realistic image from a given natural language description. Previous works have made significant progress with Generative Adversarial Networks (GANs). Nonetheless, it is still hard to generate intact objects or clear textures (Fig 1). To address this issue, we propose Feature-Aware Generative Adversarial Network (FA-GAN) to synthesize a high-quality image by integrating two techniques: a self-supervised discriminator and a feature-aware loss. First, we design a self-supervised discriminator with an auxiliary decoder so that the discriminator can extract better representation. Secondly, we introduce a feature-aware loss to provide the generator more direct supervision by employing the feature representation from the self-supervised discriminator. Experiments on the MS-COCO dataset show that our proposed method significantly advances the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Advanced Image and Video Retrieval Techniques