A Simple and Effective Baseline for Attentional Generative Adversarial   Networks

Mingyu Jin; Chong Zhang; Qinkai Yu; Haochen Xue; Xiaobo Jin; Xi Yang

arXiv:2306.14708·cs.CV·July 7, 2023·2 cites

A Simple and Effective Baseline for Attentional Generative Adversarial Networks

Mingyu Jin, Chong Zhang, Qinkai Yu, Haochen Xue, Xiaobo Jin, Xi Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces SEAttnGAN, a simplified and more efficient text-to-image GAN that removes redundant structures and optimizes loss functions, maintaining performance while improving training speed and model size.

Contribution

The paper proposes a streamlined version of AttnGAN that reduces complexity and enhances efficiency without sacrificing image quality.

Findings

01

Model size is reduced significantly.

02

Training efficiency is improved.

03

Image quality remains high.

Abstract

Synthesising a text-to-image model of high-quality images by guiding the generative model through the Text description is an innovative and challenging task. In recent years, AttnGAN based on the Attention mechanism to guide GAN training has been proposed, SD-GAN, which adopts a self-distillation technique to improve the performance of the generator and the quality of image generation, and Stack-GAN++, which gradually improves the details and quality of the image by stacking multiple generators and discriminators. However, this series of improvements to GAN all have redundancy to a certain extent, which affects the generation performance and complexity to a certain extent. We use the popular simple and effective idea (1) to remove redundancy structure and improve the backbone network of AttnGAN. (2) to integrate and reconstruct multiple losses of DAMSM. Our improvements have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jmyissb/SEAttnGAN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Processing and 3D Reconstruction