Hierarchical Imitation Learning for Stochastic Environments
Maximilian Igl, Punit Shah, Paul Mougin, Sirish Srinivasan, Tarun, Gupta, Brandyn White, Kyriacos Shiarlis, Shimon Whiteson

TL;DR
This paper introduces Robust Type Conditioning (RTC), a novel adversarial training method for hierarchical imitation learning that effectively disentangles internal and external factors, improving behavior diversity and realism in stochastic environments like autonomous driving.
Contribution
The paper proposes RTC, a new adversarial training approach that addresses distribution shift in hierarchical imitation learning for stochastic environments, ensuring more realistic agent behaviors.
Findings
RTC improves distributional realism in behavior modeling.
RTC maintains or enhances task performance.
Experiments on Waymo dataset validate effectiveness.
Abstract
Many applications of imitation learning require the agent to generate the full distribution of behaviour observed in the training data. For example, to evaluate the safety of autonomous vehicles in simulation, accurate and diverse behaviour models of other road users are paramount. Existing methods that improve this distributional realism typically rely on hierarchical policies. These condition the policy on types such as goals or personas that give rise to multi-modal behaviour. However, such methods are often inappropriate for stochastic environments where the agent must also react to external factors: because agent types are inferred from the observed future trajectory during training, these environments require that the contributions of internal and external factors to the agent behaviour are disentangled and only internal factors, i.e., those under the agent's control, are encoded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Human Pose and Action Recognition · Explainable Artificial Intelligence (XAI)
