A Quantitative Comparison of Centralised and Distributed Reinforcement Learning-Based Control for Soft Robotic Arms
Linxin Hou, Qirui Wu, Zhihang Qin, Neil Banerjee, Yongxin Guo, Cecilia Laschi

TL;DR
This study compares centralized and distributed reinforcement learning methods for controlling soft robotic arms, revealing trade-offs in efficiency, robustness, and success rates depending on system complexity.
Contribution
It provides a systematic, quantitative analysis of centralized versus distributed RL architectures for soft robotic control across varying system sizes.
Findings
Distributed policies excel in robustness and success rate for larger systems.
Centralized policies are more time-efficient during training.
Distributed RL shows higher sample efficiency in complex systems.
Abstract
This paper presents a quantitative comparison between centralised and distributed multi-agent reinforcement learning (MARL) architectures for controlling a soft robotic arm modelled as a Cosserat rod in simulation. Using PyElastica and the OpenAI Gym interface, we train both a global Proximal Policy Optimisation (PPO) controller and a Multi-Agent PPO (MAPPO) under identical budgets. Both approaches are based on the arm having number of controlled sections. The study systematically varies and evaluates the performance of the arm to reach a fixed target in three scenarios: default baseline condition, recovery from external disturbance, and adaptation to actuator failure. Quantitative metrics used for the evaluation are mean action magnitude, mean final distance, mean episode length, and success rate. The results show that there are no significant benefits of the distributed policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoft Robotics and Applications · Reinforcement Learning in Robotics · Robot Manipulation and Learning
