Loading paper
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers | Tomesphere