Efficient generative adversarial networks using linear additive-attention Transformers

Emilio Morales-Juarez; Gibran Fuentes-Pineda

arXiv:2401.09596·cs.CV·July 8, 2025·2 cites

Efficient generative adversarial networks using linear additive-attention Transformers

Emilio Morales-Juarez, Gibran Fuentes-Pineda

PDF

Open Access 2 Repos

TL;DR

LadaGAN introduces a linear attention Transformer-based GAN architecture that significantly reduces computational costs while outperforming existing models on benchmarks, making high-quality image generation more accessible.

Contribution

The paper presents LadaGAN, a novel GAN architecture with linear additive-attention Transformers that improves efficiency and stability over traditional and Transformer-based GANs.

Findings

01

LadaGAN outperforms existing convolutional and Transformer GANs on benchmarks.

02

LadaGAN is significantly more computationally efficient.

03

LadaGAN achieves competitive results with less resources.

Abstract

Although the capacity of deep generative models for image generation, such as Diffusion Models (DMs) and Generative Adversarial Networks (GANs), has dramatically improved in recent years, much of their success can be attributed to computationally expensive architectures. This has limited their adoption and use to research laboratories and companies with large resources, while significantly raising the carbon footprint for training, fine-tuning, and inference. In this work, we present a novel GAN architecture which we call LadaGAN. This architecture is based on a linear attention Transformer block named Ladaformer. The main component of this block is a linear additive-attention mechanism that computes a single attention vector per head instead of the quadratic dot-product attention. We employ Ladaformer in both the generator and discriminator, which reduces the computational complexity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsMulti-Head Attention · Attention Is All You Need · Label Smoothing · Absolute Position Encodings · Layer Normalization · Dropout · Linear Layer · Byte Pair Encoding · Softmax · Adam