Spatial Commonsense Graph for Object Localisation in Partial Scenes

Francesco Giuliari; Geri Skenderi; Marco Cristani; Yiming; Wang; Alessio Del Bue

arXiv:2203.05380·cs.CV·March 15, 2022

Spatial Commonsense Graph for Object Localisation in Partial Scenes

Francesco Giuliari, Geri Skenderi, Marco Cristani, Yiming, Wang, Alessio Del Bue

PDF

1 Repo

TL;DR

This paper introduces the Spatial Commonsense Graph, a novel scene graph model that leverages commonsense knowledge and graph neural networks to accurately localize objects in partial 3D scenes, outperforming existing methods.

Contribution

The paper proposes the Spatial Commonsense Graph and a two-step localization approach, combining a graph neural network with a circular intersection method, for improved object localization in partial scenes.

Findings

01

Achieves superior localization accuracy on a new partial scene dataset.

02

Demonstrates effective generalization to unseen 3D scenes.

03

Outperforms baseline methods in object localization tasks.

Abstract

We solve object localisation in partial scenes, a new problem of estimating the unknown position of an object (e.g. where is the bag?) given a partial 3D scan of a scene. The proposed solution is based on a novel scene graph model, the Spatial Commonsense Graph (SCG), where objects are the nodes and edges define pairwise distances between them, enriched by concept nodes and relationships from a commonsense knowledge base. This allows SCG to better generalise its spatial inference over unknown 3D scenes. The SCG is used to estimate the unknown position of the target object in two steps: first, we feed the SCG into a novel Proximity Prediction Network, a graph neural network that uses attention to perform distance prediction between the node representing the target object and the nodes representing the observed objects in the SCG; second, we propose a Localisation Module based on circular…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fgiuliari/spatialcommonsensegraph-dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGraph Neural Network