Towards Balanced Behavior Cloning from Imbalanced Datasets
Sagar Parekh, Heramb Nemlekar, and Dylan P. Losey

TL;DR
This paper addresses the challenge of imbalanced datasets in robot imitation learning by analyzing the impact of data imbalance, proposing rebalancing algorithms, and introducing a meta-gradient approach to improve policy performance.
Contribution
The paper formally proves the effects of data imbalance on learned policies and introduces a novel meta-gradient rebalancing algorithm for better imitation learning.
Findings
Rebalancing datasets improves downstream policy performance.
Imbalanced data causes policies to favor the most frequent behaviors.
Meta-gradient rebalancing outperforms existing methods.
Abstract
Robots should be able to learn complex behaviors from human demonstrations. In practice, these human-provided datasets are inevitably imbalanced: i.e., the human demonstrates some subtasks more frequently than others. State-of-the-art methods default to treating each element of the human's dataset as equally important. So if -- for instance -- the majority of the human's data focuses on reaching a goal, and only a few state-action pairs move to avoid an obstacle, the learning algorithm will place greater emphasis on goal reaching. More generally, misalignment between the relative amounts of data and the importance of that data causes fundamental problems for imitation learning approaches. In this paper we analyze and develop learning methods that automatically account for mixed datasets. We formally prove that imbalanced data leads to imbalanced policies when each state-action pair is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Internet Traffic Analysis and Secure E-voting · Imbalanced Data Classification Techniques
