A Refined 3D Gaussian Representation for High-Quality Dynamic Scene   Reconstruction

Bin Zhang; Bi Zeng; Zexin Peng

arXiv:2405.17891·cs.CV·May 29, 2024

A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction

Bin Zhang, Bi Zeng, Zexin Peng

PDF

Open Access

TL;DR

This paper introduces a refined 3D Gaussian representation that enhances high-quality dynamic scene reconstruction by reducing memory usage and improving rendering speed and quality, using a deformable MLP, hash encoding, and denoising techniques.

Contribution

It proposes a novel dynamic 3D Gaussian model with noise reduction and motion constraints, advancing scene reconstruction efficiency and quality over previous methods.

Findings

01

Outperforms existing methods in rendering quality and speed

02

Reduces memory usage significantly

03

Effective noise removal and motion consistency enforcement

Abstract

In recent years, Neural Radiance Fields (NeRF) has revolutionized three-dimensional (3D) reconstruction with its implicit representation. Building upon NeRF, 3D Gaussian Splatting (3D-GS) has departed from the implicit representation of neural networks and instead directly represents scenes as point clouds with Gaussian-shaped distributions. While this shift has notably elevated the rendering quality and speed of radiance fields but inevitably led to a significant increase in memory usage. Additionally, effectively rendering dynamic scenes in 3D-GS has emerged as a pressing challenge. To address these concerns, this paper purposes a refined 3D Gaussian representation for high-quality dynamic scene reconstruction. Firstly, we use a deformable multi-layer perceptron (MLP) network to capture the dynamic offset of Gaussian points and express the color features of points through hash…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Robotics and Sensor-Based Localization

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings