CPSL: Representing Volumetric Video via Content-Promoted Scene Layers

Kaiyuan Hu; Yili Jin; Junhua Liu; Xize Duan; Hong Kang; and Xue Liu

arXiv:2511.14927·cs.CV·November 20, 2025

CPSL: Representing Volumetric Video via Content-Promoted Scene Layers

Kaiyuan Hu, Yili Jin, Junhua Liu, Xize Duan, Hong Kang, and Xue Liu

PDF

Open Access

TL;DR

CPSL introduces a compact 2.5D video representation that enables scalable, real-time volumetric video synthesis by decomposing frames into content-guided layers, significantly reducing costs while maintaining high perceptual quality.

Contribution

The paper proposes CPSL, a novel 2.5D scene layer method guided by depth and saliency, facilitating efficient, high-quality volumetric video rendering without expensive 3D reconstruction.

Findings

01

Outperforms layer-based and neural-field methods in perceptual quality.

02

Reduces storage and rendering costs by several folds.

03

Supports real-time playback with standard video codecs.

Abstract

Volumetric video enables immersive and interactive visual experiences by supporting free viewpoint exploration and realistic motion parallax. However, existing volumetric representations from explicit point clouds to implicit neural fields, remain costly in capture, computation, and rendering, which limits their scalability for on-demand video and reduces their feasibility for real-time communication. To bridge this gap, we propose Content-Promoted Scene Layers (CPSL), a compact 2.5D video representation that brings the perceptual benefits of volumetric video to conventional 2D content. Guided by per-frame depth and content saliency, CPSL decomposes each frame into a small set of geometry-consistent layers equipped with soft alpha bands and an edge-depth cache that jointly preserve occlusion ordering and boundary continuity. These lightweight, 2D-encodable assets enable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Video Coding and Compression Technologies