Gradient Policy on "CartPole" game and its' expansibility to F1Tenth   Autonomous Vehicles

Mingwei Shi

arXiv:2103.08396·cs.RO·March 16, 2021

Gradient Policy on "CartPole" game and its' expansibility to F1Tenth Autonomous Vehicles

Mingwei Shi

PDF

Open Access

TL;DR

This paper explores the use of policy gradient methods in the CartPole environment and investigates how these techniques can be transferred to control F1Tenth autonomous vehicles by analyzing the similarity in their dynamic models.

Contribution

It provides a mathematical and implementation framework for policy gradient in CartPole and demonstrates the potential for model transfer to autonomous vehicle control using bicycle models.

Findings

01

Policy gradient effectively estimates continuous actions in CartPole.

02

Similarity between CartPole and vehicle turning angles facilitates model transfer.

03

Potential for applying reinforcement learning to autonomous vehicle control.

Abstract

Policy gradient is an effective way to estimate continuous action on the environment. This paper, it about explaining the mathematical formula and code implementation. In the end, comparing between the rotation angle of the stick on CartPole , and the angle of the Autonomous vehicle when turning, and utilizing the Bicycle Model, a simple Kinematic dynamic model, are the purpose to discover the similarity between these two models, so as to facilitate the model transfer from CartPole to the F1tenth Autonomous vehicle.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTransportation and Mobility Innovations