ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

Xiefan Guo; Miaomiao Cui; Liefeng Bo; Di Huang

arXiv:2507.22604·cs.CV·July 31, 2025

ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

Xiefan Guo, Miaomiao Cui, Liefeng Bo, Di Huang

PDF

TL;DR

ShortFT introduces a shortcut-based fine-tuning method for diffusion models, significantly improving alignment with reward functions by reducing computational costs and enhancing effectiveness over traditional backpropagation approaches.

Contribution

The paper proposes a novel shortcut-based fine-tuning strategy using a trajectory-preserving few-step diffusion model to improve alignment efficiency and performance.

Findings

01

Enhanced alignment with reward functions

02

Reduced computational costs during fine-tuning

03

Outperforms state-of-the-art methods

Abstract

Backpropagation-based approaches aim to align diffusion models with reward functions through end-to-end backpropagation of the reward gradient within the denoising chain, offering a promising perspective. However, due to the computational costs and the risk of gradient explosion associated with the lengthy denoising chain, existing approaches struggle to achieve complete gradient backpropagation, leading to suboptimal results. In this paper, we introduce Shortcut-based Fine-Tuning (ShortFT), an efficient fine-tuning strategy that utilizes the shorter denoising chain. More specifically, we employ the recently researched trajectory-preserving few-step diffusion model, which enables a shortcut over the original denoising chain, and construct a shortcut-based denoising chain of shorter length. The optimization on this chain notably enhances the efficiency and effectiveness of fine-tuning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.