TL;DR
PhysX-Omni is a unified framework for generating and understanding diverse simulation-ready 3D assets, advancing applications in embodied AI and physics-based simulation.
Contribution
It introduces a novel geometry representation, a comprehensive dataset, and a versatile benchmark for 3D asset generation and understanding across multiple categories.
Findings
PhysX-Omni outperforms existing methods in both generation and understanding tasks.
The PhysXVerse dataset covers diverse indoor and outdoor categories.
PhysX-Bench effectively evaluates key attributes of 3D assets.
Abstract
Simulation-ready physical 3D assets have emerged as a promising direction owing to their broad applicability in downstream tasks. However, most existing 3D generation methods either neglect physical properties or are limited to a single asset category, e.g., rigid, deformable, or articulated objects. To address these limitations, we introduce PhysX-Omni, a unified framework for simulation-ready physical 3D generation across diverse asset types. Specifically, we develop a novel and efficient geometry representation tailored for Vision-Language Models, which directly encodes high-resolution 3D structures without compression, significantly improving generation performance. In addition, we construct the first general simulation-ready 3D dataset, PhysXVerse, covering diverse indoor and outdoor categories. Furthermore, to comprehensively and flexibly evaluate both generative and understanding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
