Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding   Embodied Visuo-Locomotive Interactions

Jakob Suchan; Mehul Bhatt

arXiv:1709.05293·cs.RO·September 18, 2017

Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions

Jakob Suchan, Mehul Bhatt

PDF

TL;DR

This paper introduces a comprehensive model that combines visual processing and human-centered spatial reasoning to improve understanding of robot interactions with environments.

Contribution

It presents an integrative methodology for grounding visuo-spatial and locomotive interactions in robotics using AI-based semantic models.

Findings

01

Effective semantic grounding demonstrated in object interactions

02

Successful indoor movement understanding

03

Framework bridges low-level vision and high-level spatial reasoning

Abstract

We present a commonsense, qualitative model for the semantic grounding of embodied visuo-spatial and locomotive interactions. The key contribution is an integrative methodology combining low-level visual processing with high-level, human-centred representations of space and motion rooted in artificial intelligence. We demonstrate practical applicability with examples involving object interactions, and indoor movement.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.