TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction

DaDong Jiang; Zhihui Ke; Xiaobo Zhou; Zhi Hou; Xianghui Yang; Wenbo Hu; Tie Qiu; Chunchao Guo

arXiv:2411.11941·cs.CV·October 7, 2025

TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction

DaDong Jiang, Zhihui Ke, Xiaobo Zhou, Zhi Hou, Xianghui Yang, Wenbo Hu, Tie Qiu, Chunchao Guo

PDF

Open Access 1 Repo

TL;DR

TimeFormer introduces a transformer-based module that implicitly models motion patterns in deformable 3D Gaussian reconstruction, significantly improving dynamic scene reconstruction quality without sacrificing inference speed.

Contribution

It presents a novel plug-and-play TimeFormer module with a Cross-Temporal Transformer Encoder and a two-stream optimization strategy for enhanced dynamic scene reconstruction.

Findings

01

Improves reconstruction quality in complex dynamic scenes.

02

Maintains original rendering speed during inference.

03

Validates effectiveness through extensive experiments.

Abstract

Dynamic scene reconstruction is a long-term challenge in 3D vision. Recent methods extend 3D Gaussian Splatting to dynamic scenes via additional deformation fields and apply explicit constraints like motion flow to guide the deformation. However, they learn motion changes from individual timestamps independently, making it challenging to reconstruct complex scenes, particularly when dealing with violent movement, extreme-shaped geometries, or reflective surfaces. To address the above issue, we design a plug-and-play module called TimeFormer to enable existing deformable 3D Gaussians reconstruction methods with the ability to implicitly model motion patterns from a learning perspective. Specifically, TimeFormer includes a Cross-Temporal Transformer Encoder, which adaptively learns the temporal relationships of deformable 3D Gaussians. Furthermore, we propose a two-stream optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PatrickDDj/TimeFormer-Code
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Advanced Vision and Imaging · Cell Image Analysis Techniques

MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Adam · Residual Connection · Byte Pair Encoding · Balanced Selection · Linear Layer · Softmax · Position-Wise Feed-Forward Layer