RayFormer: Modeling Inter- and Intra-Ray Similarity for NeRF-Based Video Snapshot Compressive Imaging

Yubo Dong; Danhua Liu; Anqi Li; Zhenyuan Lin

arXiv:2604.27702·cs.CV·May 1, 2026

RayFormer: Modeling Inter- and Intra-Ray Similarity for NeRF-Based Video Snapshot Compressive Imaging

Yubo Dong, Danhua Liu, Anqi Li, Zhenyuan Lin

PDF

TL;DR

RayFormer introduces a novel patch-level ray sampling and transformer-based approach to improve NeRF-based video snapshot compressive imaging, capturing structural similarities for superior reconstruction quality.

Contribution

The paper proposes a patch-level sampling strategy and a transformer model to better capture structural similarities in NeRF-based SCI, enhancing reconstruction performance.

Findings

01

Achieves state-of-the-art reconstruction results in simulated scenes.

02

Effectively models inter- and intra-ray structural similarities.

03

Incorporates total variation prior for spatial smoothness.

Abstract

Video snapshot compressive imaging (SCI) enables the reconstruction of dynamic scenes from a single snapshot measurement. Recently, NeRF-based methods have shown promising reconstruction performance. However, such methods typically adopt random ray sampling strategies and fail to capture content structural similarities, resulting in limited reconstruction quality. To address these issues, we first propose a patch-level ray sampling strategy to enable the modeling of content structure. Then, we propose an Inter- and Intra-Ray Transformer (RayFormer) to capture the structural similarities, modeling both inter-ray similarities among spatially neighboring points at the same depth and intra-ray correlations between adjacent points along the viewing ray. Finally, benefiting from the patch-level sampling strategy, the total variation prior is incorporated into the objective function to enhance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.