LCUDiff: Latent Capacity Upgrade Diffusion for Faithful Human Body Restoration

Jue Gong; Zihan Zhou; Jingkai Wang; Shu Li; Libo Liu; Jianliang Lan; Yulun Zhang

arXiv:2602.04406·cs.CV·February 5, 2026

LCUDiff: Latent Capacity Upgrade Diffusion for Faithful Human Body Restoration

Jue Gong, Zihan Zhou, Jingkai Wang, Shu Li, Libo Liu, Jianliang Lan, Yulun Zhang

PDF

Open Access

TL;DR

LCUDiff introduces a novel one-step latent diffusion framework that upgrades pre-trained models from 4-channel to 16-channel latent spaces, significantly improving human body restoration fidelity while maintaining efficiency.

Contribution

The paper presents a new method for latent space upgrade in diffusion models, including channel splitting distillation, prior-preserving adaptation, and a decoder router for enhanced human body image restoration.

Findings

01

Achieves higher fidelity in human body restoration with fewer artifacts.

02

Maintains one-step efficiency in the diffusion process.

03

Demonstrates superior performance on synthetic and real-world datasets.

Abstract

Existing methods for restoring degraded human-centric images often struggle with insufficient fidelity, particularly in human body restoration (HBR). Recent diffusion-based restoration methods commonly adapt pre-trained text-to-image diffusion models, where the variational autoencoder (VAE) can significantly bottleneck restoration fidelity. We propose LCUDiff, a stable one-step framework that upgrades a pre-trained latent diffusion model from the 4-channel latent space to the 16-channel latent space. For VAE fine-tuning, channel splitting distillation (CSD) is used to keep the first four channels aligned with pre-trained priors while allocating the additional channels to effectively encode high-frequency details. We further design prior-preserving adaptation (PPA) to smoothly bridge the mismatch between 4-channel diffusion backbones and the higher-dimensional 16-channel latent. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Advanced Image Processing Techniques