PackUV: Packed Gaussian UV Maps for 4D Volumetric Video
Aashish Rai, Angela Xing, Anushka Agarwal, Xiaoyan Cong, Zekun Li, Tao Lu, Aayush Prakash, Srinath Sridhar

TL;DR
PackUV introduces a novel 4D Gaussian representation mapped into multi-scale UV atlases, enabling compact, video-codec-compatible storage and scalable, high-quality volumetric video reconstruction for long sequences.
Contribution
The paper presents PackUV, a new 4D Gaussian representation with a UV atlas format and a fitting method that ensures temporal consistency and compatibility with standard video codecs.
Findings
Outperforms existing methods in rendering fidelity.
Supports sequences up to 30 minutes with consistent quality.
Introduces the largest multi-view volumetric video dataset to date.
Abstract
Volumetric videos offer immersive 4D experiences, but remain difficult to reconstruct, store, and stream at scale. Existing Gaussian Splatting based methods achieve high-quality reconstruction but break down on long sequences, temporal inconsistency, and fail under large motions and disocclusions. Moreover, their outputs are typically incompatible with conventional video coding pipelines, preventing practical applications. We introduce PackUV, a novel 4D Gaussian representation that maps all Gaussian attributes into a sequence of structured, multi-scale UV atlas, enabling compact, image-native storage. To fit this representation from multi-view videos, we propose PackUV-GS, a temporally consistent fitting method that directly optimizes Gaussian parameters in the UV domain. A flow-guided Gaussian labeling and video keyframing module identifies dynamic Gaussians, stabilizes static…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Advanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis
