Amber-Image: Efficient Compression of Large-Scale Diffusion Transformers

Chaojie Yang; Tian Li; Yue Zhang; Jun Gao

arXiv:2602.17047·cs.CV·February 20, 2026

Amber-Image: Efficient Compression of Large-Scale Diffusion Transformers

Chaojie Yang, Tian Li, Yue Zhang, Jun Gao

PDF

Open Access 1 Models

TL;DR

This paper introduces Amber-Image, a highly efficient compression framework for large diffusion transformers, enabling lightweight T2I models with minimal training and computational costs while maintaining high quality.

Contribution

The paper presents a novel compression method that reduces model size by 70% and enables cost-effective training of high-performance diffusion models without large-scale data engineering.

Findings

01

Amber-Image-10B uses depth pruning and distillation to compress models.

02

Amber-Image-6B employs a hybrid architecture with progressive distillation.

03

Models achieve high-fidelity synthesis comparable to larger models.

Abstract

Diffusion Transformer (DiT) architectures have significantly advanced Text-to-Image (T2I) generation but suffer from prohibitive computational costs and deployment barriers. To address these challenges, we propose an efficient compression framework that transforms the 60-layer dual-stream MMDiT-based Qwen-Image into lightweight models without training from scratch. Leveraging this framework, we introduce Amber-Image, a series of streamlined T2I models. We first derive Amber-Image-10B using a timestep-sensitive depth pruning strategy, where retained layers are reinitialized via local weight averaging and optimized through layer-wise distillation and full-parameter fine-tuning. Building on this, we develop Amber-Image-6B by introducing a hybrid-stream architecture that converts deep-layer dual streams into a single stream initialized from the image branch, further refined via progressive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
hellogroup-opensource/AMBER-IMAGE
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Image Enhancement Techniques · Generative Adversarial Networks and Image Synthesis