Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement

Chia-Hern Lai; I-Hsuan Lo; Yen-Ku Yeh; Thanh-Nguyen Truong; Ching-Chun Huang

arXiv:2512.08215·cs.CV·December 10, 2025

Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement

Chia-Hern Lai, I-Hsuan Lo, Yen-Ku Yeh, Thanh-Nguyen Truong, Ching-Chun Huang

PDF

Open Access

TL;DR

Blur2Sharp introduces a novel framework combining 3D-aware neural rendering and diffusion models to generate sharp, consistent, and realistic human images from a single reference view, advancing pose and view synthesis.

Contribution

It presents a dual-conditioning approach with Human NeRF and diffusion models, integrating hierarchical feature fusion and priors for improved visual quality and consistency.

Findings

01

Outperforms state-of-the-art in pose and view synthesis

02

Generates sharper, more realistic human images

03

Handles challenging scenarios with clothing and occlusions

Abstract

The creation of lifelike human avatars capable of realistic pose variation and viewpoint flexibility remains a fundamental challenge in computer vision and graphics. Current approaches typically yield either geometrically inconsistent multi-view images or sacrifice photorealism, resulting in blurry outputs under diverse viewing angles and complex motions. To address these issues, we propose Blur2Sharp, a novel framework integrating 3D-aware neural rendering and diffusion models to generate sharp, geometrically consistent novel-view images from only a single reference view. Our method employs a dual-conditioning architecture: initially, a Human NeRF model generates geometrically coherent multi-view renderings for target poses, explicitly encoding 3D structural guidance. Subsequently, a diffusion model conditioned on these renderings refines the generated images, preserving fine-grained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Advanced Vision and Imaging