Sequence Matters: Harnessing Video Models in 3D Super-Resolution

Hyun-kyu Ko; Dongheok Park; Youngin Park; Byeonghyeon Lee; Juhee Han,; Eunbyung Park

arXiv:2412.11525·cs.CV·December 24, 2024

Sequence Matters: Harnessing Video Models in 3D Super-Resolution

Hyun-kyu Ko, Dongheok Park, Youngin Park, Byeonghyeon Lee, Juhee Han,, Eunbyung Park

PDF

Open Access

TL;DR

This paper demonstrates that video super-resolution models can significantly improve 3D model reconstruction from low-resolution multi-view images by enhancing spatial consistency and detail, even without precise image alignment.

Contribution

It introduces a practical approach leveraging VSR models for 3D super-resolution, achieving state-of-the-art results without complex fine-tuning or trajectory smoothing.

Findings

01

VSR models improve spatial consistency in 3D reconstructions.

02

Simple alignment algorithms can match state-of-the-art performance.

03

Effective on benchmark datasets like NeRF-synthetic and MipNeRF-360.

Abstract

3D super-resolution aims to reconstruct high-fidelity 3D models from low-resolution (LR) multi-view images. Early studies primarily focused on single-image super-resolution (SISR) models to upsample LR images into high-resolution images. However, these methods often lack view consistency because they operate independently on each image. Although various post-processing techniques have been extensively explored to mitigate these inconsistencies, they have yet to fully resolve the issues. In this paper, we perform a comprehensive study of 3D super-resolution by leveraging video super-resolution (VSR) models. By utilizing VSR models, we ensure a higher degree of spatial consistency and can reference surrounding spatial information, leading to more accurate and detailed reconstructions. Our findings reveal that VSR models can perform remarkably well even on sequences that lack precise…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques

MethodsALIGN