TL;DR
OSDFace introduces a one-step diffusion model for face restoration that improves efficiency and maintains high identity consistency, surpassing current methods in quality and metrics.
Contribution
The paper proposes a novel one-step diffusion approach with a visual representation embedder and identity loss, enhancing face restoration speed and accuracy.
Findings
OSDFace outperforms SOTA methods in visual quality.
It achieves higher identity consistency in restored faces.
Experimental results confirm improved quantitative metrics.
Abstract
Diffusion models have demonstrated impressive performance in face restoration. Yet, their multi-step inference process remains computationally intensive, limiting their applicability in real-world scenarios. Moreover, existing methods often struggle to generate face images that are harmonious, realistic, and consistent with the subject's identity. In this work, we propose OSDFace, a novel one-step diffusion model for face restoration. Specifically, we propose a visual representation embedder (VRE) to better capture prior information and understand the input face. In VRE, low-quality faces are processed by a visual tokenizer and subsequently embedded with a vector-quantized dictionary to generate visual prompts. Additionally, we incorporate a facial identity loss derived from face recognition to further ensure identity consistency. We further employ a generative adversarial network (GAN)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
