HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Desen Sun; Jason Hon; Jintao Zhang; Sihang Liu

arXiv:2603.07815·cs.CV·March 10, 2026

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Desen Sun, Jason Hon, Jintao Zhang, Sihang Liu

PDF

Open Access

TL;DR

HybridStitch introduces a novel diffusion model acceleration method that dynamically combines large and small models at pixel and timestep levels, significantly speeding up text-to-image generation without quality loss.

Contribution

It presents a hybrid approach that intelligently switches between models based on image complexity, improving efficiency over existing methods.

Findings

01

Achieves 1.83× speedup on Stable Diffusion 3

02

Outperforms all existing mixture model methods

03

Maintains high generation quality

Abstract

Diffusion models have demonstrated a remarkable ability in Text-to-Image (T2I) generation applications. Despite the advanced generation output, they suffer from heavy computation overhead, especially for large models that contain tens of billions of parameters. Prior work has illustrated that replacing part of the denoising steps with a smaller model still maintains the generation quality. However, these methods only focus on saving computation for some timesteps, ignoring the difference in compute demand within one timestep. In this work, we propose HybridStitch, a new T2I generation paradigm that treats generation like editing. Specifically, we introduce a hybrid stage that jointly incorporates both the large model and the small model. HybridStitch separates the entire image into two regions: one that is relatively easy to render, enabling an early transition to the smaller model, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis