Controllable 3D Placement of Objects with Scene-Aware Diffusion Models

Mohamed Omran; Dimitris Kalatzis; Jens Petersen; Amirhossein Habibian; Auke Wiggers

arXiv:2506.21446·cs.CV·June 27, 2025

Controllable 3D Placement of Objects with Scene-Aware Diffusion Models

Mohamed Omran, Dimitris Kalatzis, Jens Petersen, Amirhossein Habibian, Auke Wiggers

PDF

Open Access

TL;DR

This paper introduces a scene-aware diffusion model approach for precise 3D object placement in images, enabling flexible control over position, orientation, and shape while preserving background integrity.

Contribution

The authors propose a novel conditioning method using visual maps and coarse masks for high-quality, controllable object placement that maintains background details.

Findings

01

Effective in automotive scene editing

02

Accurate pose and location control demonstrated

03

Compatible with appearance adjustments

Abstract

Image editing approaches have become more powerful and flexible with the advent of powerful text-conditioned generative models. However, placing objects in an environment with a precise location and orientation still remains a challenge, as this typically requires carefully crafted inpainting masks or prompts. In this work, we show that a carefully designed visual map, combined with coarse object masks, is sufficient for high quality object placement. We design a conditioning signal that resolves ambiguities, while being flexible enough to allow for changing of shapes or object orientations. By building on an inpainting model, we leave the background intact by design, in contrast to methods that model objects and background jointly. We demonstrate the effectiveness of our method in the automotive setting, where we compare different conditioning signals in novel object placement tasks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage

MethodsInpainting