# On Training Flexible Robots using Deep Reinforcement Learning

**Authors:** Zach Dwiel, Madhavun Candadai, Mariano Phielipp

arXiv: 1907.00269 · 2019-07-16

## TL;DR

This paper investigates the effectiveness of deep reinforcement learning in training flexible robots, demonstrating its ability to learn robust policies for complex tasks despite sensor sensitivities.

## Contribution

It systematically evaluates DRL methods for flexible robot control, highlighting their potential and limitations in real-world adaptable robotics.

## Key findings

- DRL can learn efficient policies for flexible robots.
- Deep Deterministic Policy Gradients are sensitive to sensor choices.
- Adding more sensors does not always simplify learning.

## Abstract

The use of robotics in controlled environments has flourished over the last several decades and training robots to perform tasks using control strategies developed from dynamical models of their hardware have proven very effective. However, in many real-world settings, the uncertainties of the environment, the safety requirements and generalized capabilities that are expected of robots make rigid industrial robots unsuitable. This created great research interest into developing control strategies for flexible robot hardware for which building dynamical models are challenging. In this paper, inspired by the success of deep reinforcement learning (DRL) in other areas, we systematically study the efficacy of policy search methods using DRL in training flexible robots. Our results indicate that DRL is successfully able to learn efficient and robust policies for complex tasks at various degrees of flexibility. We also note that DRL using Deep Deterministic Policy Gradients can be sensitive to the choice of sensors and adding more informative sensors does not necessarily make the task easier to learn.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.00269/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1907.00269/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1907.00269/full.md

---
Source: https://tomesphere.com/paper/1907.00269