TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction
DaDong Jiang, Zhihui Ke, Xiaobo Zhou, Zhi Hou, Xianghui Yang, Wenbo Hu, Tie Qiu, Chunchao Guo

TL;DR
TimeFormer introduces a transformer-based module that implicitly models motion patterns in deformable 3D Gaussian reconstruction, significantly improving dynamic scene reconstruction quality without sacrificing inference speed.
Contribution
It presents a novel plug-and-play TimeFormer module with a Cross-Temporal Transformer Encoder and a two-stream optimization strategy for enhanced dynamic scene reconstruction.
Findings
Improves reconstruction quality in complex dynamic scenes.
Maintains original rendering speed during inference.
Validates effectiveness through extensive experiments.
Abstract
Dynamic scene reconstruction is a long-term challenge in 3D vision. Recent methods extend 3D Gaussian Splatting to dynamic scenes via additional deformation fields and apply explicit constraints like motion flow to guide the deformation. However, they learn motion changes from individual timestamps independently, making it challenging to reconstruct complex scenes, particularly when dealing with violent movement, extreme-shaped geometries, or reflective surfaces. To address the above issue, we design a plug-and-play module called TimeFormer to enable existing deformable 3D Gaussians reconstruction methods with the ability to implicitly model motion patterns from a learning perspective. Specifically, TimeFormer includes a Cross-Temporal Transformer Encoder, which adaptively learns the temporal relationships of deformable 3D Gaussians. Furthermore, we propose a two-stream optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Vision and Imaging · Cell Image Analysis Techniques
MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Adam · Residual Connection · Byte Pair Encoding · Balanced Selection · Linear Layer · Softmax · Position-Wise Feed-Forward Layer
