TL;DR
VeGaS introduces a novel Gaussian-based video representation that enables realistic editing and outperforms existing methods in frame reconstruction, addressing limitations of previous neural implicit models.
Contribution
The paper presents VeGaS, a new model using Folded-Gaussian distributions for nonlinear video dynamics, allowing effective editing and improved reconstruction over prior Gaussian splatting approaches.
Findings
Outperforms state-of-the-art in frame reconstruction
Enables realistic video modifications
Models nonlinear dynamics with Folded-Gaussian distributions
Abstract
Implicit Neural Representations (INRs) employ neural networks to approximate discrete data as continuous functions. In the context of video data, such models can be utilized to transform the coordinates of pixel locations along with frame occurrence times (or indices) into RGB color values. Although INRs facilitate effective compression, they are unsuitable for editing purposes. One potential solution is to use a 3D Gaussian Splatting (3DGS) based model, such as the Video Gaussian Representation (VGR), which is capable of encoding video as a multitude of 3D Gaussians and is applicable for numerous video processing operations, including editing. Nevertheless, in this case, the capacity for modification is constrained to a limited set of basic transformations. To address this issue, we introduce the Video Gaussian Splatting (VeGaS) model, which enables realistic modifications of video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
