Multi-task View Synthesis with Neural Radiance Fields

Shuhong Zheng; Zhipeng Bao; Martial Hebert; Yu-Xiong Wang

arXiv:2309.17450·cs.CV·October 2, 2023

Multi-task View Synthesis with Neural Radiance Fields

Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

This paper introduces MuvieNeRF, a novel framework for multi-task view synthesis that leverages multi-view and multi-task attention mechanisms to synthesize various scene properties, outperforming traditional models.

Contribution

The paper proposes MuvieNeRF, a new multi-task view synthesis framework with cross-task and cross-view attention modules, enabling versatile scene property synthesis across different NeRF backbones.

Findings

01

MuvieNeRF achieves high-quality multi-property synthesis on synthetic and real benchmarks.

02

It outperforms conventional discriminative models in multi-view scene property prediction.

03

The framework demonstrates universal applicability across various NeRF architectures.

Abstract

Multi-task visual learning is a critical aspect of computer vision. Current research, however, predominantly concentrates on the multi-task dense prediction setting, which overlooks the intrinsic 3D world and its multi-view consistent structures, and lacks the capability for versatile imagination. In response to these limitations, we present a novel problem setting -- multi-task view synthesis (MTVS), which reinterprets multi-task prediction as a set of novel-view synthesis tasks for multiple scene properties, including RGB. To tackle the MTVS problem, we propose MuvieNeRF, a framework that incorporates both multi-task and cross-view knowledge to simultaneously synthesize multiple scene properties. MuvieNeRF integrates two key modules, the Cross-Task Attention (CTA) and Cross-View Attention (CVA) modules, enabling the efficient use of information across multiple views and tasks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zsh2000/muvienerf
pytorchOfficial

Datasets

ShuhongZheng/MTVS
dataset· 20 dl
20 dl

Videos

Multi-task View Synthesis with Neural Radiance Fields· youtube

Taxonomy

TopicsAdvanced Vision and Imaging · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications