Semi-Supervised Imitation Learning with Mixed Qualities of Demonstrations for Autonomous Driving
Gunmin Lee, Wooseok Oh, Seungyoun Shin, Dohyeong Kim, Jeongwoo Oh,, Jaeyeon Jeong, Sungjoon Choi, Songhwai Oh

TL;DR
This paper introduces SSIL, a semi-supervised imitation learning method that assesses and leverages the quality of unlabeled demonstrations to improve autonomous driving performance, even with mixed demonstration qualities.
Contribution
The paper proposes a novel approach to evaluate and incorporate unlabeled demonstrations of varying quality into imitation learning for autonomous driving.
Findings
Effective discrimination of demonstration quality improves learning.
Method performs well with mixed-quality unlabeled data.
Successful real-world application on RC car hardware.
Abstract
In this paper, we consider the problem of autonomous driving using imitation learning in a semi-supervised manner. In particular, both labeled and unlabeled demonstrations are leveraged during training by estimating the quality of each unlabeled demonstration. If the provided demonstrations are corrupted and have a low signal-to-noise ratio, the performance of the imitation learning agent can be degraded significantly. To mitigate this problem, we propose a method called semi-supervised imitation learning (SSIL). SSIL first learns how to discriminate and evaluate each state-action pair's reliability in unlabeled demonstrations by assigning higher reliability values to demonstrations similar to labeled expert demonstrations. This reliability value is called leverage. After this discrimination process, both labeled and unlabeled demonstrations with estimated leverage values are utilized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Machine Learning and Data Classification
