Guiding Visual Autoregressive Models through Spectrum Weakening

Chaoyang Wang; Tianmeng Yang; Jingdong Wang; Yunhai Tong

arXiv:2511.22991·cs.CV·December 1, 2025

Guiding Visual Autoregressive Models through Spectrum Weakening

Chaoyang Wang, Tianmeng Yang, Jingdong Wang, Yunhai Tong

PDF

Open Access

TL;DR

This paper introduces a spectrum-weakening framework for visual autoregressive models that improves unconditional generation quality without retraining or architectural changes, by controlling spectral information in internal representations.

Contribution

It proposes a novel spectral domain manipulation method for AR models that enhances generation quality and condition alignment without model re-training or structural modifications.

Findings

01

Enables high-quality unconditional generation

02

Maintains strong prompt alignment for conditional tasks

03

Works on both discrete and continuous AR models

Abstract

Classifier-free guidance (CFG) has become a widely adopted and practical approach for enhancing generation quality and improving condition alignment. Recent studies have explored guidance mechanisms for unconditional generation, yet these approaches remain fundamentally tied to assumptions specific to diffusion models. In this work, we propose a spectrum-weakening framework for visual autoregressive (AR) models. This method works without the need for re-training, specific conditions, or any architectural modifications. It achieves this by constructing a controllable weak model in the spectral domain. We theoretically show that invertible spectral transformations preserve information, while selectively retaining only a subset of spectrum introduces controlled information reduction. Based on this insight, we perform spectrum selection along the channel dimension of internal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications