Risk Averse Bayesian Reward Learning for Autonomous Navigation from   Human Demonstration

Christian Ellis; Maggie Wigness; John G. Rogers III; Craig Lennon,; Lance Fiondella

arXiv:2108.00276·cs.RO·August 3, 2021

Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

Christian Ellis, Maggie Wigness, John G. Rogers III, Craig Lennon,, Lance Fiondella

PDF

Open Access

TL;DR

This paper introduces a Bayesian method for reward learning in autonomous navigation that accounts for uncertainty, enabling safer planning in dynamic environments with minimal human demonstrations.

Contribution

It presents a novel Bayesian approach to quantify and incorporate uncertainty in reward weights, improving safety and robustness in robot navigation from limited demonstrations.

Findings

01

Successfully avoided dangerous terrain in 2 out of 3 scenarios

02

Reduced overall risk compared to existing methods

03

Operated safely without additional demonstrations

Abstract

Traditional imitation learning provides a set of methods and algorithms to learn a reward function or policy from expert demonstrations. Learning from demonstration has been shown to be advantageous for navigation tasks as it allows for machine learning non-experts to quickly provide information needed to learn complex traversal behaviors. However, a minimal set of demonstrations is unlikely to capture all relevant information needed to achieve the desired behavior in every possible future operational environment. Due to distributional shift among environments, a robot may encounter features that were rarely or never observed during training for which the appropriate reward value is uncertain, leading to undesired outcomes. This paper proposes a Bayesian technique which quantifies uncertainty over the weights of a linear reward function given a dataset of minimal human demonstrations to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning