Situated Multimodal Control of a Mobile Robot: Navigation through a   Virtual Environment

Katherine Krajovic; Nikhil Krishnaswamy; Nathaniel J. Dimick; R. Pito; Salas; and James Pustejovsky

arXiv:2007.09053·cs.RO·July 20, 2020·1 cites

Situated Multimodal Control of a Mobile Robot: Navigation through a Virtual Environment

Katherine Krajovic, Nikhil Krishnaswamy, Nathaniel J. Dimick, R. Pito, Salas, and James Pustejovsky

PDF

Open Access

TL;DR

This paper introduces a multimodal control interface combining gesture and language for navigating a robot in new environments, utilizing embodied simulation and cross-platform communication.

Contribution

It presents a novel integrated system enabling natural human-robot interaction through coordinated gestures and spoken commands for navigation tasks.

Findings

01

Effective control of robot navigation via multimodal input

02

Successful integration of embodied simulation for environment understanding

03

Enhanced human-robot communication in navigation scenarios

Abstract

We present a new interface for controlling a navigation robot in novel environments using coordinated gesture and language. We use a TurtleBot3 robot with a LIDAR and a camera, an embodied simulation of what the robot has encountered while exploring, and a cross-platform bridge facilitating generic communication. A human partner can deliver instructions to the robot using spoken English and gestures relative to the simulated environment, to guide the robot through navigation tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Speech and dialogue systems · Social Robot Interaction and HRI