SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling

Elisabetta Fedele; Francis Engelmann; Ian Huang; Or Litany; Marc Pollefeys; Leonidas Guibas

arXiv:2512.05343·cs.CV·March 16, 2026

SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling

Elisabetta Fedele, Francis Engelmann, Ian Huang, Or Litany, Marc Pollefeys, Leonidas Guibas

PDF

Open Access 3 Reviews

TL;DR

SpaceControl is a training-free, test-time method that provides intuitive spatial control over 3D generative models, allowing precise manipulation of object geometry without additional training.

Contribution

It introduces a novel test-time approach for explicit spatial control in 3D generation that accepts diverse geometric inputs and integrates seamlessly with existing models.

Findings

01

Outperforms baselines in geometric faithfulness

02

Maintains high visual quality

03

Enables real-time interactive editing

Abstract

Generative methods for 3D assets have recently achieved remarkable progress, yet providing intuitive and precise control over the object geometry remains a key challenge. Existing approaches predominantly rely on text or image prompts, which often fall short in geometric specificity: language can be ambiguous, and images are difficult to manipulate. In this work, we introduce SpaceControl, a training-free test-time method for explicit spatial control of 3D asset generation. Our approach accepts a wide range of geometric inputs, from coarse primitives to detailed meshes, and integrates seamlessly with modern generative models without requiring any additional training. A control parameter lets users trade off between geometric fidelity and output realism. Extensive quantitative evaluation and user studies demonstrate that SpaceControl outperforms both training-based and optimization-based…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 4

Strengths

1: The proposed SPACECONTROL is a training-free method that injects explicit spatial control into a powerful pre-trained 3D generative model (Trellis) purely at test-time, which is efficient. 2: The framework provides intuitive geometric control using diverse inputs, from coarse primitives to detailed meshes, and introduces a single parameter that allows users to flexibly trade off between geometric faithfulness and output realism. 3: Extensive experiments show the method significantly outperf

Weaknesses

1: **Reliance on Manual Parameter Tuning.** The crucial trade-off between realism and faithfulness is governed by the $\tau_0$ parameter, which the paper states must be "selected manually"2. While Table 2 and Figure 3 show the effect of varying $\tau_0$, the optimal value appears to be object-dependent. This manual, per-instance tuning requirement undermines the method's practicality and hinders its use in automated generation pipelines. 2: **Lack of Part-Based Control.** The parameter $\tau_0$

Reviewer 02Rating 6Confidence 4

Strengths

**Training-free controllability.** The whole system is designed without model training or surgery, which preserves the strong 3D generative prior in Trellis [1], and the approach introduced seems to be also applicable to other / future 3D generation under the Trellis framework. **Good controllability.** With superquadrics, this method enables explicit control with only a few primitives, which is user-friendly and accurate on most shapes. It also introduces a tunable adherence parameter that ena

Weaknesses

**Implied control limitation on curvy shapes.** The 3D geometry representation, superquadrics, is often suitable for objects primarily containing convex shapes - cubes, spheres, cylinders, etc., while for concave shapes, it requires multiple superquadrics to compose. In pure 3D reconstruction domain, this does not impose a problem, but in terms of controllability, where simplicity or the number of the underlying primitives is more critical, superquadrics may suffer. **Limited conceptual novelty

Reviewer 03Rating 4Confidence 4

Strengths

1. The motivation is clear and practical. Allowing users to directly manipulate geometry using 3D primitives, aligns well with real-world design workflows. 2. The paper is well-structured, with a logical flow and clear figures. 3. The performance is good and includes a user study and interactive interface to demonstrate practical utility.

Weaknesses

1. Limited technical novelty. The core idea of injecting geometric guidance before the appearance generation stage appears to be a natural extension of Trellis’s inherent two-stage design (coarse geometry -> fine appearance). This raises the question of whether the controllability stems more from Trellis itself than from a genuinely novel contribution. 2. Potential generalizability problem beyond two-stage architectures. The method is tightly coupled to Trellis’s specific disentangled geometry-

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques