AlignVTOFF: Texture-Spatial Feature Alignment for High-Fidelity Virtual Try-Off

Yihan Zhu; Mengying Ge

arXiv:2601.02038·cs.CV·January 6, 2026

AlignVTOFF: Texture-Spatial Feature Alignment for High-Fidelity Virtual Try-Off

Yihan Zhu, Mengying Ge

PDF

Open Access

TL;DR

AlignVTOFF introduces a novel framework combining multi-scale feature extraction and texture-spatial alignment to generate high-fidelity virtual garment try-ons with preserved details and realistic deformation.

Contribution

The paper proposes AlignVTOFF, a new parallel U-Net architecture with Texture-Spatial Feature Alignment for improved detail preservation in virtual try-off tasks.

Findings

01

Outperforms state-of-the-art methods in structural realism.

02

Enhances high-frequency detail fidelity.

03

Robustly models complex geometric deformation.

Abstract

Virtual Try-Off (VTOFF) is a challenging multimodal image generation task that aims to synthesize high-fidelity flat-lay garments under complex geometric deformation and rich high-frequency textures. Existing methods often rely on lightweight modules for fast feature extraction, which struggles to preserve structured patterns and fine-grained details, leading to texture attenuation during generation.To address these issues, we propose AlignVTOFF, a novel parallel U-Net framework built upon a Reference U-Net and Texture-Spatial Feature Alignment (TSFA). The Reference U-Net performs multi-scale feature extraction and enhances geometric fidelity, enabling robust modeling of deformation while retaining complex structured patterns. TSFA then injects the reference garment features into a frozen denoising U-Net via a hybrid attention design, consisting of a trainable cross-attention module and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques