SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting

Shengjie Lin; Jiading Fang; Muhammad Zubair Irshad; Vitor Campagnolo Guizilini; Rares Andrei Ambrus; Greg Shakhnarovich; Matthew R. Walter

arXiv:2506.03594·cs.GR·June 5, 2025

SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting

Shengjie Lin, Jiading Fang, Muhammad Zubair Irshad, Vitor Campagnolo Guizilini, Rares Andrei Ambrus, Greg Shakhnarovich, Matthew R. Walter

PDF

Open Access 1 Repo

TL;DR

SplArt is a self-supervised framework that uses 3D Gaussian Splatting to accurately reconstruct and infer articulation of objects from RGB images, enabling real-time rendering without needing 3D annotations.

Contribution

It introduces a category-agnostic, self-supervised method with a multi-stage optimization for articulated object reconstruction and kinematic inference using 3D Gaussian Splatting.

Findings

01

Achieves state-of-the-art accuracy in articulated object reconstruction.

02

Operates in real-time with photorealistic rendering.

03

Does not require 3D supervision or category-specific priors.

Abstract

Reconstructing articulated objects prevalent in daily environments is crucial for applications in augmented/virtual reality and robotics. However, existing methods face scalability limitations (requiring 3D supervision or costly annotations), robustness issues (being susceptible to local optima), and rendering shortcomings (lacking speed or photorealism). We introduce SplArt, a self-supervised, category-agnostic framework that leverages 3D Gaussian Splatting (3DGS) to reconstruct articulated objects and infer kinematics from two sets of posed RGB images captured at different articulation states, enabling real-time photorealistic rendering for novel viewpoints and articulations. SplArt augments 3DGS with a differentiable mobility parameter per Gaussian, achieving refined part segmentation. A multi-stage optimization strategy is employed to progressively handle reconstruction, part…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ripl/splart
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · 3D Shape Modeling and Analysis · Advanced Vision and Imaging

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings