PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

Ziang Cao; Fangzhou Hong; Zhaoxi Chen; Liang Pan; Ziwei Liu

arXiv:2511.13648·cs.CV·November 18, 2025

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

Ziang Cao, Fangzhou Hong, Zhaoxi Chen, Liang Pan, Ziwei Liu

PDF

Open Access 1 Models 1 Datasets

TL;DR

PhysX-Anything is a novel framework that generates high-quality, simulation-ready 3D assets with physical and articulation properties from a single image, advancing embodied AI and physics simulation.

Contribution

It introduces the first VLM-based physical 3D generative model with a new efficient 3D representation and a large, diverse dataset for physical 3D objects.

Findings

01

Produces high-quality, simulation-ready 3D assets from single images

02

Demonstrates strong generalization and performance in experiments

03

Enables direct use of assets for robotic policy learning in simulation

Abstract

3D modeling is shifting from static visual representations toward physical, articulated assets that can be directly used in simulation and interaction. However, most existing 3D generation methods overlook key physical and articulation properties, thereby limiting their utility in embodied AI. To bridge this gap, we introduce PhysX-Anything, the first simulation-ready physical 3D generative framework that, given a single in-the-wild image, produces high-quality sim-ready 3D assets with explicit geometry, articulation, and physical attributes. Specifically, we propose the first VLM-based physical 3D generative model, along with a new 3D representation that efficiently tokenizes geometry. It reduces the number of tokens by 193x, enabling explicit geometry learning within standard VLM token budgets without introducing any special tokens during fine-tuning and significantly improving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Caoza/PhysX-Anything
model· 1 dl· ♡ 8
1 dl♡ 8

Datasets

Caoza/PhysX-Mobility
dataset· 90 dl
90 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Human Motion and Animation