Theoretical Analysis of Offline Imitation With Supplementary Dataset

Ziniu Li; Tian Xu; Yang Yu; Zhi-Quan Luo

arXiv:2301.11687·cs.LG·January 30, 2023

Theoretical Analysis of Offline Imitation With Supplementary Dataset

Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo

PDF

Open Access 1 Repo

TL;DR

This paper provides a theoretical foundation for offline imitation learning with supplementary datasets, introducing new methods that outperform traditional behavioral cloning in certain scenarios.

Contribution

It develops a theoretical analysis of NBCU and WBCU methods, showing conditions under which supplementary data can improve imitation learning performance.

Findings

01

NBCU can outperform or match BC in special cases despite larger imitation gaps.

02

WBCU, with importance sampling, can outperform BC under mild conditions.

03

Empirical results demonstrate WBCU's superior performance on challenging tasks.

Abstract

Behavioral cloning (BC) can recover a good policy from abundant expert data, but may fail when expert data is insufficient. This paper considers a situation where, besides the small amount of expert data, a supplementary dataset is available, which can be collected cheaply from sub-optimal policies. Imitation learning with a supplementary dataset is an emergent practical framework, but its theoretical foundation remains under-developed. To advance understanding, we first investigate a direct extension of BC, called NBCU, that learns from the union of all available data. Our analysis shows that, although NBCU suffers an imitation gap that is larger than BC in the worst case, there exist special cases where NBCU performs better than or equally well as BC. This discovery implies that noisy data can also be helpful if utilized elaborately. Therefore, we further introduce a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liziniu/ilwsd
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms

Methodsfail