Reinforcement Learning for Ballbot Navigation in Uneven Terrain

Achkan Salehi

arXiv:2505.18417·cs.RO·February 2, 2026

Reinforcement Learning for Ballbot Navigation in Uneven Terrain

Achkan Salehi

PDF

Open Access 1 Repo

TL;DR

This paper introduces an open-source MuJoCo simulation for ballbot navigation on uneven terrain, demonstrating that classical RL methods can effectively learn navigation policies with reasonable data efficiency, surpassing traditional control approaches.

Contribution

It provides the first open-source RL-compatible simulation environment for ballbot navigation and shows RL's effectiveness in this domain with minimal data.

Findings

01

RL policies successfully navigate uneven terrain

02

Effective training achieved within four to five hours

03

Open-source code available for further research

Abstract

Ballbot (i.e. Ball balancing robot) navigation usually relies on methods rooted in control theory (CT), and works that apply Reinforcement learning (RL) to the problem remain rare while generally being limited to specific subtasks (e.g. balance recovery). Unlike CT based methods, RL does not require (simplifying) assumptions about environment dynamics (e.g. the absence of slippage between the ball and the floor). In addition to this increased accuracy in modeling, RL agents can easily be conditioned on additional observations such as depth-maps without the need for explicit formulations from first principles, leading to increased adaptivity. Despite those advantages, there has been little to no investigation into the capabilities, data-efficiency and limitations of RL based methods for ballbot control and navigation. Furthermore, there is a notable absence of an open-source, RL-friendly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

salehiac/openballbot-rl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Modular Robots and Swarm Intelligence · Reinforcement Learning in Robotics