JODA: Composable Joint Dynamics for Articulated Objects

Tianhong Gao; Cheng Yu; Yinghao Xu; Mengyu Chu

arXiv:2605.09954·cs.RO·May 12, 2026

JODA: Composable Joint Dynamics for Articulated Objects

Tianhong Gao, Cheng Yu, Yinghao Xu, Mengyu Chu

PDF

TL;DR

JODA introduces a structured, interpretable framework for modeling detailed joint dynamics in articulated objects, enabling realistic simulation and control through inference and refinement from multimodal data.

Contribution

It presents a novel joint-level dynamics representation using a three-channel field, integrating vision-language models for inference and enabling controllable, differentiable simulation.

Findings

01

JODA accurately models diverse joint behaviors.

02

The framework supports manipulation and gradient-based refinement.

03

Code and assets will be publicly released.

Abstract

Articulated objects used in simulation and embodied AI are typically specified by geometry and kinematic structure, but lack the fine-grained dynamical effects that govern realistic mechanical behavior, such as frictional holding, detents, soft closing, and snap latching. Existing approaches either ignore the detailed structure of dynamics entirely, or use simple models with limited expressiveness. We introduce JODA, a framework for generating joint-level dynamics as a structured three-channel field over the joint degree of freedom, capturing conservative forces, dry friction, and damping. Instantiated using shape-constrained piecewise cubic interpolation (PCHIP), this formulation defines a compact and expressive function space that is both interpretable and compatible with differentiable simulation. Building on this representation, we develop methods for inferring and refining joint…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.