Beyond Imitation: A Life-long Policy Learning Framework for Path Tracking Control of Autonomous Driving
C. Gong, C. Lu, Z. Li, Z. Liu, J. Gong, X. Chen

TL;DR
This paper introduces a lifelong policy learning framework for autonomous vehicle path tracking that improves control policies over time using incremental data, surpassing traditional imitation learning methods.
Contribution
The paper proposes a novel lifelong learning framework that enhances imitation learning-based control policies for autonomous driving through incremental updates and knowledge evaluation.
Findings
The LLPL framework improves path tracking accuracy over time.
It achieves better control smoothness compared to baseline methods.
It is effective with noisy real-world driving data.
Abstract
Model-free learning-based control methods have recently shown significant advantages over traditional control methods in avoiding complex vehicle characteristic estimation and parameter tuning. As a primary policy learning method, imitation learning (IL) is capable of learning control policies directly from expert demonstrations. However, the performance of IL policies is highly dependent on the data sufficiency and quality of the demonstrations. To alleviate the above problems of IL-based policies, a lifelong policy learning (LLPL) framework is proposed in this paper, which extends the IL scheme with lifelong learning (LLL). First, a novel IL-based model-free control policy learning method for path tracking is introduced. Even with imperfect demonstration, the optimal control policy can be learned directly from historical driving data. Second, by using the LLL method, the pre-trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
