Balance Equation-based Distributionally Robust Offline Imitation Learning

Rishabh Agrawal; Yusuf Alvi; Rahul Jain; Ashutosh Nayyar

arXiv:2511.07942·cs.LG·November 12, 2025

Balance Equation-based Distributionally Robust Offline Imitation Learning

Rishabh Agrawal, Yusuf Alvi, Rahul Jain, Ashutosh Nayyar

PDF

Open Access

TL;DR

This paper introduces a novel distributionally robust offline imitation learning framework that learns policies resilient to environment dynamics shifts using only expert demonstrations, without additional environment interaction.

Contribution

It formulates a robust optimization approach based on balance equations that is fully offline and data-driven, improving robustness against transition model uncertainties.

Findings

01

Outperforms existing offline IL methods under environment shifts

02

Achieves higher robustness and generalization in continuous control tasks

03

Enables offline learning without environment interaction

Abstract

Imitation Learning (IL) has proven highly effective for robotic and control tasks where manually designing reward functions or explicit controllers is infeasible. However, standard IL methods implicitly assume that the environment dynamics remain fixed between training and deployment. In practice, this assumption rarely holds where modeling inaccuracies, real-world parameter variations, and adversarial perturbations can all induce shifts in transition dynamics, leading to severe performance degradation. We address this challenge through Balance Equation-based Distributionally Robust Offline Imitation Learning, a framework that learns robust policies solely from expert demonstrations collected under nominal dynamics, without requiring further environment interaction. We formulate the problem as a distributionally robust optimization over an uncertainty set of transition models, seeking a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Adversarial Robustness in Machine Learning