Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning

Yuxuan Gu; Weimin Bai; Yifei Wang; Weijian Luo; He Sun

arXiv:2511.15190·cs.LG·March 18, 2026

Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning

Yuxuan Gu, Weimin Bai, Yifei Wang, Weijian Luo, He Sun

PDF

Open Access

TL;DR

This paper introduces MARVAL, a distillation framework that accelerates masked auto-regressive diffusion models, making reinforcement learning practical by enabling fast inference without sacrificing sample quality.

Contribution

The paper proposes a novel score-based variational objective for distilling diffusion models into a single step and develops an efficient RL framework for masked auto-regressive models.

Findings

01

MARVAL achieves over 30x speedup on ImageNet 256x256.

02

MARVAL-Huge attains an FID of 2.00 on ImageNet 256x256.

03

MARVAL-RL improves CLIP and image-reward scores on ImageNet datasets.

Abstract

Masked auto-regressive diffusion models (MAR) benefit from the expressive modeling ability of diffusion models and the flexibility of masked auto-regressive ordering. However, vanilla MAR suffers from slow inference due to its hierarchical inference mechanism: an outer AR unmasking loop and an inner diffusion denoising chain. Such decoupled structure not only harm the generation efficiency but also hinder the practical use of MAR for reinforcement learning (RL), an increasingly critical paradigm for generative model post-training.To address this fundamental issue, we introduce MARVAL (Masked Auto-regressive Variational Acceleration), a distillation-based framework that compresses the diffusion chain into a single AR generation step while preserving the flexible auto-regressive unmasking order. Such a distillation with MARVAL not only yields substantial inference acceleration but,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Advanced Neuroimaging Techniques and Applications