JRM: Joint Reconstruction Model for Multiple Objects without Alignment

Qirui Wu; Yawar Siddiqui; Duncan Frost; Samir Aroudj; Armen Avetisyan; Richard Newcombe; Angel X. Chang; Jakob Engel; Henry Howard-Jenkins

arXiv:2603.25985·cs.CV·March 30, 2026

JRM: Joint Reconstruction Model for Multiple Objects without Alignment

Qirui Wu, Yawar Siddiqui, Duncan Frost, Samir Aroudj, Armen Avetisyan, Richard Newcombe, Angel X. Chang, Jakob Engel, Henry Howard-Jenkins

PDF

TL;DR

JRM is a 3D generative model that implicitly aggregates multiple unaligned observations to improve object reconstruction, especially in non-rigid scenarios, without explicit alignment.

Contribution

It introduces a novel flow-matching generative approach that handles unaligned data and non-rigid changes, surpassing prior explicit alignment methods.

Findings

01

JRM outperforms baseline methods in reconstruction quality.

02

Implicit aggregation enhances robustness to alignment errors.

03

Handles non-rigid transformations like articulation effectively.

Abstract

Object-centric reconstruction seeks to recover the 3D structure of a scene through composition of independent objects. While this independence can simplify modeling, it discards strong signals that could improve reconstruction, notably repetition where the same object model is seen multiple times in a scene, or across scans. We propose the Joint Reconstruction Model (JRM) to leverage repetition by framing object reconstruction as one of personalized generation: multiple observations share a common subject that should be consistent for all observations, while still adhering to the specific pose and state from each. Prior methods in this direction rely on explicit matching and rigid alignment across observations, making them sensitive to errors and difficult to extend to non-rigid transformations. In contrast, JRM is a 3D flow-matching generative model that implicitly aggregates unaligned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.