DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines

Ye Tian; Zhen Jia; Ziyue Luo; Yida Wang; Chuan Wu

arXiv:2405.01248·cs.DC·May 3, 2024·1 cites

DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines

Ye Tian, Zhen Jia, Ziyue Luo, Yida Wang, Chuan Wu

PDF

Open Access

TL;DR

DiffusionPipe introduces an efficient pipeline parallel training system for large diffusion models, utilizing bubble filling and optimized partitioning to significantly improve training speed and resource utilization.

Contribution

It proposes a novel pipeline training system with bubble filling and dynamic partitioning for large diffusion models, enhancing training efficiency and throughput.

Findings

01

Achieves up to 1.41x speedup over pipeline parallel methods.

02

Achieves up to 1.28x speedup over data parallel training.

03

Effectively integrates non-trainable parts into pipeline training.

Abstract

Diffusion models have emerged as dominant performers for image generation. To support training large diffusion models, this paper studies pipeline parallel training of diffusion models and proposes DiffusionPipe, a synchronous pipeline training system that advocates innovative pipeline bubble filling technique, catering to structural characteristics of diffusion models. State-of-the-art diffusion models typically include trainable (the backbone) and non-trainable (e.g., frozen input encoders) parts. We first unify optimal stage partitioning and pipeline scheduling of single and multiple backbones in representative diffusion models with a dynamic programming approach. We then propose to fill the computation of non-trainable model parts into idle periods of the pipeline training of the backbones by an efficient greedy algorithm, thus achieving high training throughput. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms