Human-in-the-Loop Robot Planning with Non-Contextual Bandit Feedback

Yijie Zhou; Yan Zhang; Xusheng Luo; Michael M. Zavlanos

arXiv:2011.01793·cs.RO·November 4, 2020·1 cites

Human-in-the-Loop Robot Planning with Non-Contextual Bandit Feedback

Yijie Zhou, Yan Zhang, Xusheng Luo, Michael M. Zavlanos

PDF

Open Access

TL;DR

This paper introduces a semi-supervised Bayesian Optimization approach for robot trajectory planning in human-populated environments, effectively using non-contextual human feedback to optimize satisfaction while ensuring safety and feasibility.

Contribution

It proposes a novel combination of autoencoder-based dimensionality reduction and biased Bayesian Optimization to efficiently plan human-aware robot trajectories with minimal human feedback.

Findings

01

Efficiently finds collision-free, human-satisfactory trajectories

02

Reduces high-dimensional planning to low-dimensional latent space

03

Demonstrates effectiveness in diverse human scenarios

Abstract

In this paper, we consider a robot navigation problem in environments populated by humans. The goal is to determine collision-free and dynamically feasible trajectories that also maximize human satisfaction. This is because they may drive the robot close to humans that need help with their work or because they may keep the robot away from humans when it can interfere with human sight or work. In practice, human satisfaction is subjective and hard to describe mathematically. As a result, the planning problem we consider in this paper may lack important contextual information. To address this challenge, we propose a semi-supervised Bayesian Optimization (BO) method to design globally optimal robot trajectories using non-contextual bandit human feedback in the form of complaints or satisfaction ratings that express how satisfactory a trajectory is, without revealing the reason. Since…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics