CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation

Yi Liu; Shengqian Li; Zuzeng Lin; Feng Wang; Si Liu

arXiv:2506.23347·cs.CV·July 8, 2025

CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation

Yi Liu, Shengqian Li, Zuzeng Lin, Feng Wang, Si Liu

PDF

Open Access 1 Repo

TL;DR

CycleVAR introduces a novel differentiable autoregressive framework for unsupervised one-step image translation, leveraging Softmax Relaxed Quantization to enable end-to-end training and outperforming previous models in quality and speed.

Contribution

It proposes CycleVAR, a new autoregressive model with Softmax Relaxed Quantization for improved unsupervised image translation, allowing parallel one-step generation and better performance.

Findings

01

Parallel one-step generation achieves higher quality and faster inference.

02

CycleVAR surpasses state-of-the-art models like CycleGAN-Turbo.

03

Softmax Relaxed Quantization enables end-to-end training of autoregressive models.

Abstract

The current conditional autoregressive image generation methods have shown promising results, yet their potential remains largely unexplored in the practical unsupervised image translation domain, which operates without explicit cross-domain correspondences. A critical limitation stems from the discrete quantization inherent in traditional Vector Quantization-based frameworks, which disrupts gradient flow between the Variational Autoencoder decoder and causal Transformer, impeding end-to-end optimization during adversarial training in image space. To tackle this issue, we propose using Softmax Relaxed Quantization, a novel approach that reformulates codebook selection as a continuous probability mixing process via Softmax, thereby preserving gradient propagation. Building upon this differentiable foundation, we introduce CycleVAR, which reformulates image-to-image translation as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iamcreateai/cyclevar
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Advanced Image Processing Techniques

MethodsDropout · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Layer Normalization · Dense Connections · Softmax · Transformer