Behavior-Aware Anthropometric Scene Generation for Human-Usable 3D Layouts
Semin Jin, Donghyuk Kim, Jeongmin Ryu, Kyung Hoon Hyun

TL;DR
This paper introduces a behavior-aware 3D scene generation framework that incorporates anthropometric data and VLM analysis to create indoor layouts optimized for human activity and usability.
Contribution
It presents a novel method combining vision-language models with anthropometric constraints to generate human-centric indoor scenes, validated through technical and human studies.
Findings
Improved task completion times in generated scenes
Enhanced trajectory efficiency for users
Better human-object manipulation space
Abstract
Well-designed indoor scenes should prioritize how people can act within a space rather than merely what objects to place. However, existing 3D scene generation methods emphasize visual and semantic plausibility, while insufficiently addressing whether people can comfortably walk, sit, or manipulate objects. To bridge this gap, we present a Behavior-Aware Anthropometric Scene Generation framework. Our approach leverages vision-language models (VLMs) to analyze object-behavior relationships, translating spatial requirements into parametric layout constraints adapted to user-specific anthropometric data. We conducted comparative studies with state-of-the-art models using geometric metrics and a user perception study (N=16). We further conducted in-depth human-scale studies (individuals, N=20; groups, N=18). The results showed improvements in task completion time, trajectory efficiency, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Motion and Animation · Human Pose and Action Recognition
