DepthScape: Authoring 2.5D Designs via Depth Estimation, Semantic Understanding, and Geometry Extraction

Xia Su; Cuong Nguyen; Matheus A. Gadelha; Jon E. Froehlich

arXiv:2512.02263·cs.HC·December 3, 2025

DepthScape: Authoring 2.5D Designs via Depth Estimation, Semantic Understanding, and Geometry Extraction

Xia Su, Cuong Nguyen, Matheus A. Gadelha, Jon E. Froehlich

PDF

Open Access

TL;DR

DepthScape is a collaborative system that simplifies creating 2.5D visual effects by integrating depth estimation, semantic understanding, and geometry extraction to enable intuitive editing of images with realistic 3D effects.

Contribution

We introduce DepthScape, a novel system combining monocular depth reconstruction and vision-language analysis for easy 2.5D design creation.

Findings

01

Effective 3D placement of design elements achieved

02

System validated with diverse users and professional images

03

Expert evaluation confirms high-quality results

Abstract

2.5D effects, such as occlusion and perspective foreshortening, enhance visual dynamics and realism by incorporating 3D depth cues into 2D designs. However, creating such effects remains challenging and labor-intensive due to the complexity of depth perception. We introduce DepthScape, a human-AI collaborative system that facilitates 2.5D effect creation by directly placing design elements into 3D reconstructions. Using monocular depth reconstruction, DepthScape transforms images into 3D reconstructions where visual contents are placed to automatically achieve realistic occlusion and perspective foreshortening. To further simplify 3D placement through a 2D viewport, DepthScape uses a vision-language model to analyze source images and extract key visual components as content anchors for direct manipulation editing. We evaluate DepthScape with nine participants of varying design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInteractive and Immersive Displays · 3D Shape Modeling and Analysis · Advanced Vision and Imaging