Efficient generative adversarial networks using linear additive-attention Transformers
Emilio Morales-Juarez, Gibran Fuentes-Pineda

TL;DR
LadaGAN introduces a linear attention Transformer-based GAN architecture that significantly reduces computational costs while outperforming existing models on benchmarks, making high-quality image generation more accessible.
Contribution
The paper presents LadaGAN, a novel GAN architecture with linear additive-attention Transformers that improves efficiency and stability over traditional and Transformer-based GANs.
Findings
LadaGAN outperforms existing convolutional and Transformer GANs on benchmarks.
LadaGAN is significantly more computationally efficient.
LadaGAN achieves competitive results with less resources.
Abstract
Although the capacity of deep generative models for image generation, such as Diffusion Models (DMs) and Generative Adversarial Networks (GANs), has dramatically improved in recent years, much of their success can be attributed to computationally expensive architectures. This has limited their adoption and use to research laboratories and companies with large resources, while significantly raising the carbon footprint for training, fine-tuning, and inference. In this work, we present a novel GAN architecture which we call LadaGAN. This architecture is based on a linear attention Transformer block named Ladaformer. The main component of this block is a linear additive-attention mechanism that computes a single attention vector per head instead of the quadratic dot-product attention. We employ Ladaformer in both the generator and discriminator, which reduces the computational complexity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis
MethodsMulti-Head Attention · Attention Is All You Need · Label Smoothing · Absolute Position Encodings · Layer Normalization · Dropout · Linear Layer · Byte Pair Encoding · Softmax · Adam
