TL;DR
CycleRL introduces a robust sim-to-real deep reinforcement learning framework for autonomous bicycle control, achieving high success rates and accurate tracking in simulation and real-world deployment.
Contribution
The paper presents a novel sim-to-real framework using DRL with domain randomization for autonomous bicycles, enabling direct transfer from simulation to real-world operation.
Findings
99.90% balance success rate in simulation
Heading tracking error of 1.15 degrees
Velocity tracking error of 0.18 m/s
Abstract
Autonomous bicycles offer a promising agile solution for urban mobility and last-mile logistics. However, conventional control strategies often struggle with underactuated nonlinear dynamics, suffering from sensitivity to model mismatches and limited adaptability to real-world uncertainties. To address this, we develop CycleRL, a comprehensive sim-to-real framework for robust autonomous bicycle control. Our approach establishes a direct perception-to-action mapping within the high-fidelity NVIDIA Isaac Sim environment, leveraging Proximal Policy Optimization (PPO) to optimize the control policy. The framework features a composite reward function tailored for concurrent balance maintenance, velocity tracking, and steering control. Crucially, systematic domain randomization is employed to reduce the reliance on precise system modeling, bridge the simulation-to-reality gap and facilitate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
