FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization
Quanjian Song, Yefeng Shen, Mengting Chen, Hao Sun, Jinsong Lan, Xiaoyong Zhu, Bo Zheng, Liujuan Cao

TL;DR
FashionChameleon is a real-time, interactive framework enabling human-garment video customization with motion coherence, using single-garment data and novel training and inference techniques for applications like e-commerce.
Contribution
It introduces a novel training and inference pipeline that allows real-time, interactive garment switching in videos using only single-garment data, with enhanced coherence.
Findings
Supports real-time generation at 23.8 FPS on a single GPU.
Achieves 30-180 times faster performance than existing methods.
Enables interactive multi-garment video customization with preserved motion coherence.
Abstract
Human-centric video customization, particularly at the garment level, has shown significant commercial value. However, existing approaches cannot support low-latency and interactive garment control, which is crucial for applications such as e-commerce and content creation. This paper studies how to achieve interactive multi-garment video customization while preserving motion coherence using only single-garment video data. We present FashionChameleon, a real-time and interactive framework for human-garment customization in autoregressive video generation, where users can interactively switch garment during generation. FashionChameleon consists of three key techniques: (i) Instead of training on multi-garment video data, we train a Teacher Model with In-Context Learning on a single reference-garment pair. By retaining the image-to-video training paradigm while enforcing a mismatch between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
