CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models

Quang-Binh Nguyen; Minh Luu; Quang Nguyen; Anh Tran; Khoi Nguyen

arXiv:2507.13984·cs.CV·March 17, 2026

CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models

Quang-Binh Nguyen, Minh Luu, Quang Nguyen, Anh Tran, Khoi Nguyen

PDF

1 Datasets

TL;DR

This paper introduces CSD-VAR, a novel content-style decomposition method for visual autoregressive models that improves disentanglement and stylization quality through innovative optimization and memory techniques.

Contribution

CSD-VAR is the first to adapt VAR models for content-style decomposition, introducing scale-aware optimization, SVD rectification, and augmented memory for better disentanglement.

Findings

01

Outperforms prior methods in content preservation.

02

Achieves higher stylization fidelity.

03

Demonstrates effectiveness on CSD-100 dataset.

Abstract

Disentangling content and style from a single image, known as content-style decomposition (CSD), enables recontextualization of extracted content and stylization of extracted styles, offering greater creative flexibility in visual synthesis. While recent personalization methods have explored the decomposition of explicit content style, they remain tailored for diffusion models. Meanwhile, Visual Autoregressive Modeling (VAR) has emerged as a promising alternative with a next-scale prediction paradigm, achieving performance comparable to that of diffusion models. In this paper, we explore VAR as a generative framework for CSD, leveraging its scale-wise generation process for improved disentanglement. To this end, we propose CSD-VAR, a novel method that introduces three key innovations: (1) a scale-aware alternating optimization strategy that aligns content and style representation with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

qualcomm/csd100
dataset· 34 dl
34 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.