Composition and Alignment of Diffusion Models using Constrained Learning

Shervin Khalafi; Ignacio Hounie; Dongsheng Ding; Alejandro Ribeiro

arXiv:2508.19104·cs.LG·December 1, 2025

Composition and Alignment of Diffusion Models using Constrained Learning

Shervin Khalafi, Ignacio Hounie, Dongsheng Ding, Alejandro Ribeiro

PDF

TL;DR

This paper introduces a unified constrained optimization framework for aligning and composing diffusion models, ensuring they meet multiple reward constraints and combine desirable attributes effectively.

Contribution

It proposes a novel theoretical and algorithmic approach to unify alignment and composition of diffusion models through constrained optimization.

Findings

01

The framework guarantees models satisfy multiple reward constraints.

02

Empirical results show effective alignment and composition in image generation.

03

The approach outperforms existing methods in maintaining desired properties.

Abstract

Diffusion models have become prevalent in generative modeling due to their ability to sample from complex distributions. To improve the quality of generated samples and their compliance with user requirements, two commonly used methods are: (i) Alignment, which involves finetuning a diffusion model to align it with a reward; and (ii) Composition, which combines several pretrained diffusion models together, each emphasizing a desirable attribute in the generated outputs. However, trade-offs often arise when optimizing for multiple rewards or combining multiple models, as they can often represent competing properties. Existing methods cannot guarantee that the resulting model faithfully generates samples with all the desired properties. To address this gap, we propose a constrained optimization framework that unifies alignment and composition of diffusion models by enforcing that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.