Implementing Adaptations for Vision AutoRegressive Model

Kaif Shaikh; Franziska Boenisch; Adam Dziedzic

arXiv:2507.11441·cs.CV·July 29, 2025

Implementing Adaptations for Vision AutoRegressive Model

Kaif Shaikh, Franziska Boenisch, Adam Dziedzic

PDF

1 Repo

TL;DR

This paper explores how to adapt Vision AutoRegressive models for specific tasks, compares them with diffusion models, and highlights the need for better private adaptation techniques for VAR.

Contribution

It implements and benchmarks various adaptation strategies for VAR, comparing them to diffusion models, and identifies challenges in private adaptations.

Findings

01

VAR outperforms DMs in non-private settings

02

DP adaptations for VAR underperform, indicating need for further research

03

Benchmarking provides insights into adaptation strategies for VAR

Abstract

Vision AutoRegressive model (VAR) was recently introduced as an alternative to Diffusion Models (DMs) in image generation domain. In this work we focus on its adaptations, which aim to fine-tune pre-trained models to perform specific downstream tasks, like medical data generation. While for DMs there exist many techniques, adaptations for VAR remain underexplored. Similarly, differentially private (DP) adaptations-ones that aim to preserve privacy of the adaptation data-have been extensively studied for DMs, while VAR lacks such solutions. In our work, we implement and benchmark many strategies for VAR, and compare them to state-of-the-art DM adaptation strategies. We observe that VAR outperforms DMs for non-DP adaptations, however, the performance of DP suffers, which necessitates further research in private adaptations for VAR. Code is available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sprintml/finetuning_var_dp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion · Focus