MAMBO-G: Magnitude-Aware Mitigation for Boosted Guidance
Shangwen Zhu, Qianyu Peng, Zhilei Shu, Yuting Hu, Zhantao Yang, Han Zhang, Zhao Pu, Andy Zheng, Xinyu Cui, Jian Zhao, Ruili Feng, Fan Cheng

TL;DR
MAMBO-G is a training-free, guidance magnitude optimization framework that accelerates high-fidelity text-to-image and video generation by stabilizing guidance scales, achieving up to 4x speedup without sacrificing quality.
Contribution
It introduces a novel guidance modulation method that dynamically adjusts guidance scales, significantly reducing computational costs in diffusion-based generation tasks.
Findings
Up to 4x speedup on diffusion models.
Effective acceleration of large-scale video synthesis.
Maintains visual fidelity despite speed improvements.
Abstract
High-fidelity text-to-image and text-to-video generation typically relies on Classifier-Free Guidance (CFG), but achieving optimal results often demands computationally expensive sampling schedules. In this work, we propose MAMBO-G, a training-free acceleration framework that significantly reduces computational cost by dynamically optimizing guidance magnitudes. We observe that standard CFG schedules are inefficient, applying disproportionately large updates in early steps that hinder convergence speed. MAMBO-G mitigates this by modulating the guidance scale based on the update-to-prediction magnitude ratio, effectively stabilizing the trajectory and enabling rapid convergence. This efficiency is particularly vital for resource-intensive tasks like video generation. Our method serves as a universal plug-and-play accelerator, achieving up to 3x speedup on Stable Diffusion v3.5 (SD3.5)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
