Control of Microrobots with Reinforcement Learning under On-Device Compute Constraints

Yichen Liu; Kesava Viswanadha; Zhongyu Li; Nelson Lojo; Kristofer S. J. Pister

arXiv:2512.24740·cs.RO·January 1, 2026

Control of Microrobots with Reinforcement Learning under On-Device Compute Constraints

Yichen Liu, Kesava Viswanadha, Zhongyu Li, Nelson Lojo, Kristofer S. J. Pister

PDF

Open Access

TL;DR

This paper demonstrates reinforcement learning-based control of a microrobot on ultra-low-power hardware, optimizing gait and inference methods to operate within strict compute and power constraints, and validates on real terrain.

Contribution

It introduces a resource-aware RL control framework for microrobots, integrating domain randomization, quantization, and gait selection to enable on-device autonomous locomotion.

Findings

01

Reinforcement learning can effectively control microrobots within hardware constraints.

02

Quantization improves inference speed on low-power microcontrollers.

03

Gait mode selection based on power budget optimizes robot performance.

Abstract

An important function of autonomous microrobots is the ability to perform robust movement over terrain. This paper explores an edge ML approach to microrobot locomotion, allowing for on-device, lower latency control under compute, memory, and power constraints. This paper explores the locomotion of a sub-centimeter quadrupedal microrobot via reinforcement learning (RL) and deploys the resulting controller on an ultra-small system-on-chip (SoC), SC $μ$ M-3C, featuring an ARM Cortex-M0 microcontroller running at 5 MHz. We train a compact FP32 multilayer perceptron (MLP) policy with two hidden layers ( $[128, 64]$ ) in a massively parallel GPU simulation and enhance robustness by utilizing domain randomization over simulation parameters. We then study integer (Int8) quantization (per-tensor and per-feature) to allow for higher inference update rates on our resource-limited hardware, and we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Micro and Nano Robotics · Reinforcement Learning in Robotics