NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for   Real-Time Novel View Synthesis

Yubin Hu; Xiaoyang Guo; Yang Xiao; Jingwei Huang; and Yong-Jin Liu

arXiv:2407.10482·cs.CV·July 16, 2024

NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis

Yubin Hu, Xiaoyang Guo, Yang Xiao, Jingwei Huang, and Yong-Jin Liu

PDF

Open Access

TL;DR

NGP-RT significantly accelerates NeRF-based view synthesis by combining hash features with lightweight attention and occupancy grids, enabling real-time rendering at high quality.

Contribution

It introduces a novel method that explicitly stores hash features and uses lightweight attention to improve rendering speed and quality in real-time NeRF applications.

Findings

01

Achieves 108 fps at 1080p resolution on a single GPU.

02

Outperforms previous NeRF methods in rendering quality on Mip-NeRF360.

03

Reduces computational complexity by replacing MLP with attention and occupancy grids.

Abstract

This paper presents NGP-RT, a novel approach for enhancing the rendering speed of Instant-NGP to achieve real-time novel view synthesis. As a classic NeRF-based method, Instant-NGP stores implicit features in multi-level grids or hash tables and applies a shallow MLP to convert the implicit features into explicit colors and densities. Although it achieves fast training speed, there is still a lot of room for improvement in its rendering speed due to the per-point MLP executions for implicit multi-level feature aggregation, especially for real-time applications. To address this challenge, our proposed NGP-RT explicitly stores colors and densities as hash features, and leverages a lightweight attention mechanism to disambiguate the hash collisions instead of using computationally intensive MLP. At the rendering stage, NGP-RT incorporates a pre-computed occupancy distance grid into the ray…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Video Analysis and Summarization · Image Retrieval and Classification Techniques

MethodsSoftmax · Attention Is All You Need · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings