Beyond the Majority: Long-tail Imitation Learning for Robotic Manipulation
Junhong Zhu, Ji Zhang, Jingkuan Song, Lianli Gao, Heng Tao Shen

TL;DR
This paper addresses the long-tail distribution challenge in imitation learning for robotic manipulation, proposing a novel transfer scheme called Approaching-Phase Augmentation (APA) that improves tail task performance without external data.
Contribution
The paper introduces APA, a new method that transfers knowledge from head to tail tasks, overcoming data scarcity and improving generalization in long-tail imitation learning.
Findings
APA significantly improves tail task performance in simulation and real-world tests.
Conventional re-sampling strategies are ineffective for long-tail policy learning.
Data scarcity impairs spatial reasoning, which APA effectively mitigates.
Abstract
While generalist robot policies hold significant promise for learning diverse manipulation skills through imitation, their performance is often hindered by the long-tail distribution of training demonstrations. Policies learned on such data, which is heavily skewed towards a few data-rich head tasks, frequently exhibit poor generalization when confronted with the vast number of data-scarce tail tasks. In this work, we conduct a comprehensive analysis of the pervasive long-tail challenge inherent in policy learning. Our analysis begins by demonstrating the inefficacy of conventional long-tail learning strategies (e.g., re-sampling) for improving the policy's performance on tail tasks. We then uncover the underlying mechanism for this failure, revealing that data scarcity on tail tasks directly impairs the policy's spatial reasoning capability. To overcome this, we introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Domain Adaptation and Few-Shot Learning
