GABRIL: Gaze-Based Regularization for Mitigating Causal Confusion in Imitation Learning
Amin Banayeeanzade, Fatemeh Bahrani, Yutai Zhou, Erdem B{\i}y{\i}k

TL;DR
GABRIL introduces gaze-based regularization to improve imitation learning by reducing causal confusion, leading to significantly better performance and enhanced explainability in Atari and CARLA benchmarks.
Contribution
This paper presents a novel gaze-based regularization method for imitation learning that mitigates causal confusion by leveraging human gaze data during training.
Findings
GABRIL outperforms baseline methods with 179% improvement in Atari environments.
GABRIL achieves 76% better performance in CARLA benchmarks.
The method enhances explainability of learned policies.
Abstract
Imitation Learning (IL) is a widely adopted approach which enables agents to learn from human expert demonstrations by framing the task as a supervised learning problem. However, IL often suffers from causal confusion, where agents misinterpret spurious correlations as causal relationships, leading to poor performance in testing environments with distribution shift. To address this issue, we introduce GAze-Based Regularization in Imitation Learning (GABRIL), a novel method that leverages the human gaze data gathered during the data collection phase to guide the representation learning in IL. GABRIL utilizes a regularization loss which encourages the model to focus on causally relevant features identified through expert gaze and consequently mitigates the effects of confounding variables. We validate our approach in Atari environments and the Bench2Drive benchmark in CARLA by collecting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
