3D-aware Blending with Generative NeRFs
Hyunsu Kim, Gayoung Lee, Yunjey Choi, Jin-Hwa Kim, Jun-Yan Zhu

TL;DR
This paper introduces a 3D-aware blending technique using generative NeRFs that aligns and blends images in 3D space, improving over traditional 2D methods for misaligned inputs.
Contribution
The paper presents a novel 3D-aware blending approach with camera pose estimation, local 3D alignment, and latent space blending within generative NeRFs, addressing limitations of 2D methods.
Findings
Outperforms existing 2D blending baselines on FFHQ and AFHQ-Cat datasets.
Demonstrates superior qualitative and quantitative results.
Effectively handles misaligned images through 3D-aware techniques.
Abstract
Image blending aims to combine multiple images seamlessly. It remains challenging for existing 2D-based methods, especially when input images are misaligned due to differences in 3D camera poses and object shapes. To tackle these issues, we propose a 3D-aware blending method using generative Neural Radiance Fields (NeRF), including two key components: 3D-aware alignment and 3D-aware blending. For 3D-aware alignment, we first estimate the camera pose of the reference image with respect to generative NeRFs and then perform 3D local alignment for each part. To further leverage 3D information of the generative NeRF, we propose 3D-aware blending that directly blends images on the NeRF's latent representation space, rather than raw pixel space. Collectively, our method outperforms existing 2D baselines, as validated by extensive quantitative and qualitative evaluations with FFHQ and AFHQ-Cat.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
3D-aware Blending with Generative NeRFs· youtube
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · 3D Surveying and Cultural Heritage
