Grid4D: 4D Decomposed Hash Encoding for High-Fidelity Dynamic Gaussian Splatting
Jiawei Xu, Zexin Fan, Jian Yang, Jin Xie

TL;DR
Grid4D introduces a novel 4D hash encoding and directional attention for dynamic scene rendering, significantly improving visual quality and speed over existing methods by avoiding low-rank assumptions and enhancing deformation fitting.
Contribution
The paper proposes Grid4D, a new explicit 4D encoding method using hash encoding and directional attention, addressing limitations of plane-based methods in dynamic scene rendering.
Findings
Outperforms state-of-the-art models in visual quality
Achieves faster rendering speeds
Effectively models diverse scene deformations
Abstract
Recently, Gaussian splatting has received more and more attention in the field of static scene rendering. Due to the low computational overhead and inherent flexibility of explicit representations, plane-based explicit methods are popular ways to predict deformations for Gaussian-based dynamic scene rendering models. However, plane-based methods rely on the inappropriate low-rank assumption and excessively decompose the space-time 4D encoding, resulting in overmuch feature overlap and unsatisfactory rendering quality. To tackle these problems, we propose Grid4D, a dynamic scene rendering model based on Gaussian splatting and employing a novel explicit encoding method for the 4D input through the hash encoding. Different from plane-based explicit representations, we decompose the 4D encoding into one spatial and three temporal 3D hash encodings without the low-rank assumption.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Data Compression Techniques · Advanced Image and Video Retrieval Techniques
MethodsSoftmax · Attention Is All You Need
