Navigation of micro-robot swarms for targeted delivery using   reinforcement learning

Akshatha Jagadish; Manoj Varma

arXiv:2306.17598·cs.RO·July 3, 2023

Navigation of micro-robot swarms for targeted delivery using reinforcement learning

Akshatha Jagadish, Manoj Varma

PDF

Open Access

TL;DR

This paper demonstrates how reinforcement learning algorithms, specifically PPO and RPO, can effectively control micro-robot swarms for targeted delivery, even with limited information and under various challenging conditions.

Contribution

It introduces the application of RL algorithms to micro-robot swarm navigation, incorporating hydrodynamic effects and curriculum learning for improved performance.

Findings

01

RL algorithms successfully navigate swarms of 4, 9, and 16 microswimmers.

02

Curriculum learning enhances navigation performance.

03

Robustness tests show effective target localization despite randomness.

Abstract

Micro robotics is quickly emerging to be a promising technological solution to many medical treatments with focus on targeted drug delivery. They are effective when working in swarms whose individual control is mostly infeasible owing to their minute size. Controlling a number of robots with a single controller is thus important and artificial intelligence can help us perform this task successfully. In this work, we use the Reinforcement Learning (RL) algorithms Proximal Policy Optimization (PPO) and Robust Policy Optimization (RPO) to navigate a swarm of 4, 9 and 16 microswimmers under hydrodynamic effects, controlled by their orientation, towards a circular absorbing target. We look at both PPO and RPO performances with limited state information scenarios and also test their robustness for random target location and size. We use curriculum learning to improve upon the performance and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMicro and Nano Robotics · Molecular Communication and Nanonetworks · Neuroscience and Neural Engineering

MethodsEntropy Regularization · Focus · Proximal Policy Optimization