MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields
Takuhiro Kaneko

TL;DR
MIMO-NeRF introduces a multi-input multi-output neural radiance field model that accelerates rendering by reducing the number of neural networks needed, using a self-supervised approach to maintain quality.
Contribution
It replaces traditional SISO MLPs with MIMO MLPs for faster rendering and proposes a self-supervised regularization method to address ambiguity issues.
Findings
Achieves a good speed-quality trade-off in rendering.
Compatible with existing fast NeRF methods like DONeRF and TensoRF.
Demonstrates efficiency with reasonable training time.
Abstract
Neural radiance fields (NeRFs) have shown impressive results for novel view synthesis. However, they depend on the repetitive use of a single-input single-output multilayer perceptron (SISO MLP) that maps 3D coordinates and view direction to the color and volume density in a sample-wise manner, which slows the rendering. We propose a multi-input multi-output NeRF (MIMO-NeRF) that reduces the number of MLPs running by replacing the SISO MLP with a MIMO MLP and conducting mappings in a group-wise manner. One notable challenge with this approach is that the color and volume density of each point can differ according to a choice of input coordinates in a group, which can lead to some notable ambiguity. We also propose a self-supervised learning method that regularizes the MIMO MLP with multiple fast reformulated MLPs to alleviate this ambiguity without using pretrained models. The results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields· youtube
Taxonomy
TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis · Advanced Vision and Imaging
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
