GenLayNeRF: Generalizable Layered Representations with 3D Model   Alignment for Multi-Human View Synthesis

Youssef Abdelkareem; Shady Shehata; Fakhri Karray

arXiv:2309.11627·cs.CV·September 22, 2023

GenLayNeRF: Generalizable Layered Representations with 3D Model Alignment for Multi-Human View Synthesis

Youssef Abdelkareem, Shady Shehata, Fakhri Karray

PDF

Open Access

TL;DR

GenLayNeRF introduces a novel, generalizable layered scene representation for multi-human view synthesis that aligns 3D models with input views without per-scene optimization, enabling high-quality free-viewpoint rendering from sparse views.

Contribution

It proposes a fully trainable, multi-human layered scene model with pixel-level 3D model alignment, eliminating the need for per-scene optimization and handling sparse views effectively.

Findings

01

Outperforms existing generalizable and non-human NeRF methods.

02

Achieves comparable results to layered per-scene methods without test-time optimization.

03

Effectively handles multi-human scenes with sparse input views.

Abstract

Novel view synthesis (NVS) of multi-human scenes imposes challenges due to the complex inter-human occlusions. Layered representations handle the complexities by dividing the scene into multi-layered radiance fields, however, they are mainly constrained to per-scene optimization making them inefficient. Generalizable human view synthesis methods combine the pre-fitted 3D human meshes with image features to reach generalization, yet they are mainly designed to operate on single-human scenes. Another drawback is the reliance on multi-step optimization techniques for parametric pre-fitting of the 3D body models that suffer from misalignment with the images in sparse view settings causing hallucinations in synthesized views. In this work, we propose, GenLayNeRF, a generalizable layered scene representation for free-viewpoint rendering of multiple human subjects which requires no per-scene…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition