Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
Zhen Zou, Xiaoxiao Ma, Jie Huang, Zichao Yu, Feng Zhao

TL;DR
Fast-ARDiff introduces an entropy-informed, unified framework that accelerates autoregressive diffusion hybrid models by optimizing both components jointly, significantly reducing latency while maintaining high-quality synthesis.
Contribution
It proposes a novel joint optimization framework with entropy-informed strategies and dynamic scheduling to accelerate AR-diffusion models for image and text generation.
Findings
Achieves 4.3× speedup on ImageNet 256×256 with TransDiff.
Attains 3× acceleration on text-conditioned generation with NextStep-1.
Demonstrates stable training and high-quality synthesis with few diffusion steps.
Abstract
Autoregressive(AR)-diffusion hybrid paradigms combine AR's structured modeling with diffusion's photorealistic synthesis, yet suffer from high latency due to sequential AR generation and iterative denoising. In this work, we tackle this bottleneck and propose a unified AR-diffusion framework Fast-ARDiff that jointly optimizes both components, accelerating AR speculative decoding while simultaneously facilitating faster diffusion decoding. Specifically: (1) The entropy-informed speculative strategy encourages draft model to produce higher-entropy representations aligned with target model's entropy characteristics, mitigating entropy mismatch and high rejection rates caused by draft overconfidence. (2) For diffusion decoding, rather than treating it as an independent module, we integrate it into the same end-to-end framework using a dynamic scheduler that prioritizes AR optimization to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Advanced Neuroimaging Techniques and Applications
