Learning Object Arrangements in 3D Scenes using Human Context

Yun Jiang (Cornell University); Marcus Lim (Cornell University),; Ashutosh Saxena (Cornell University)

arXiv:1206.6462·cs.LG·July 3, 2012·ICML·57 cites

Learning Object Arrangements in 3D Scenes using Human Context

Yun Jiang (Cornell University), Marcus Lim (Cornell University),, Ashutosh Saxena (Cornell University)

PDF

Open Access

TL;DR

This paper introduces a method for arranging objects in 3D scenes by modeling their relationships to human poses, improving scalability and accuracy over traditional object-object models.

Contribution

It proposes a novel approach that models human-object relationships using density functions and a Dirichlet process mixture model for better scene arrangement predictions.

Findings

01

Achieved an average placement error of 1.6 meters in 20 rooms.

02

Scored 4.3/5 on real scene arrangements, outperforming baseline methods.

03

Scalable modeling of human-object relationships improves scene understanding.

Abstract

We consider the problem of learning object arrangements in a 3D scene. The key idea here is to learn how objects relate to human poses based on their affordances, ease of use and reachability. In contrast to modeling object-object relationships, modeling human-object relationships scales linearly in the number of objects. We design appropriate density functions based on 3D spatial features to capture this. We learn the distribution of human poses in a scene using a variant of the Dirichlet process mixture model that allows sharing of the density function parameters across the same object types. Then we can reason about arrangements of the objects in the room based on these meaningful human poses. In our extensive experiments on 20 different rooms with a total of 47 objects, our algorithm predicted correct placements with an average error of 1.6 meters from ground truth. In arranging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Image Processing and 3D Reconstruction