PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization
Yezhi Shen, Qiuchen Zhai, Fengqing Zhu

TL;DR
This paper introduces PS4PRO, a novel video frame interpolation model that enhances neural rendering by augmenting data, leading to improved 3D scene reconstruction in static and dynamic environments.
Contribution
The paper presents PS4PRO, a lightweight high-quality video interpolation model that implicitly models camera movement and geometry to improve neural rendering and 3D reconstruction.
Findings
Improved reconstruction quality in static scenes.
Enhanced performance in dynamic scenes.
Effective data augmentation for neural rendering.
Abstract
Neural rendering methods have gained significant attention for their ability to reconstruct 3D scenes from 2D images. The core idea is to take multiple views as input and optimize the reconstructed scene by minimizing the uncertainty in geometry and appearance across the views. However, the reconstruction quality is limited by the number of input views. This limitation is further pronounced in complex and dynamic scenes, where certain angles of objects are never seen. In this paper, we propose to use video frame interpolation as the data augmentation method for neural rendering. Furthermore, we design a lightweight yet high-quality video frame interpolation model, PS4PRO (Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization). PS4PRO is trained on diverse video datasets, implicitly modeling camera movement as well as real-world 3D geometry. Our model performs as an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
MethodsSoftmax · Attention Is All You Need
