Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration
Christian Ellis, Maggie Wigness, John G. Rogers III, Craig Lennon,, Lance Fiondella

TL;DR
This paper introduces a Bayesian method for reward learning in autonomous navigation that accounts for uncertainty, enabling safer planning in dynamic environments with minimal human demonstrations.
Contribution
It presents a novel Bayesian approach to quantify and incorporate uncertainty in reward weights, improving safety and robustness in robot navigation from limited demonstrations.
Findings
Successfully avoided dangerous terrain in 2 out of 3 scenarios
Reduced overall risk compared to existing methods
Operated safely without additional demonstrations
Abstract
Traditional imitation learning provides a set of methods and algorithms to learn a reward function or policy from expert demonstrations. Learning from demonstration has been shown to be advantageous for navigation tasks as it allows for machine learning non-experts to quickly provide information needed to learn complex traversal behaviors. However, a minimal set of demonstrations is unlikely to capture all relevant information needed to achieve the desired behavior in every possible future operational environment. Due to distributional shift among environments, a robot may encounter features that were rarely or never observed during training for which the appropriate reward value is uncertain, leading to undesired outcomes. This paper proposes a Bayesian technique which quantifies uncertainty over the weights of a linear reward function given a dataset of minimal human demonstrations to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
