Beyond Spatial Compression: Interface-Centric Generative States for Open-World 3D Structure

Xiang Chen; Alexander Binder

arXiv:2605.10438·cs.LG·May 12, 2026

Beyond Spatial Compression: Interface-Centric Generative States for Open-World 3D Structure

Xiang Chen, Alexander Binder

PDF

TL;DR

This paper introduces interface-centric generative states for 3D models, enabling explicit control and repair of component relationships during decoding, improving robustness in open-world assets.

Contribution

It proposes a novel tokenization approach, C2LT-3D, that exposes local geometry and attachment variables for better structural reasoning and repair.

Findings

01

C2LT-3D improves structural robustness in open-world 3D assets.

02

Latent variables remain actionable under adversarial attachment settings.

03

Supports attachment validation and structural repair during decoding.

Abstract

Current 3D tokenizers largely treat representation as spatial compression: compact codes reconstruct surface geometry, but leave component ownership and attachment validity implicit. In open-world assets with intersecting components, noisy topology, and weak canonical structure, this creates a representation mismatch: local shape, component identity, and assembly relations become entangled in a latent stream and are not natively addressable during decoding. We formulate an alternative view, interface-centric generative states, in which tokenization constructs an operational state rather than a passive compressed code. The state exposes local geometry, component ownership, and attachment validity as variables that can be queried, constrained, and repaired during decoding. We instantiate this formulation with Component-Conditioned Canonical Local Tokens (C2LT-3D), factorizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.