Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning
Ugo Lecerf, Christelle Yemdji-Tchassi, Pietro Michiardi

TL;DR
This paper presents a hierarchical contingency planning approach for autonomous vehicles to adapt quickly to sudden changes in environment beliefs, ensuring safety in stochastic, partially observable settings.
Contribution
The paper introduces an end-to-end method for learning and integrating contingency plans with hierarchical planning for robust autonomous navigation.
Findings
Achieves robust, safe behavior in stochastic, partially observable environments.
Generalizes well to unseen environment dynamics.
Enhances autonomous vehicle safety through contingency planning.
Abstract
When learning to act in a stochastic, partially observable environment, an intelligent agent should be prepared to anticipate a change in its belief of the environment state, and be capable of adapting its actions on-the-fly to changing conditions. As humans, we are able to form contingency plans when learning a task with the explicit aim of being able to correct errors in the initial control, and hence prove useful if ever there is a sudden change in our perception of the environment which requires immediate corrective action. This is especially the case for autonomous vehicles (AVs) navigating real-world situations where safety is paramount, and a strong ability to react to a changing belief about the environment is truly needed. In this paper we explore an end-to-end approach, from training to execution, for learning robust contingency plans and combining them with a hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Logic, Reasoning, and Knowledge · Reinforcement Learning in Robotics
