Learning Visual Tracking and Reaching with Deep Reinforcement Learning   on a UR10e Robotic Arm

Colin Bellinger; Laurence Lamarche-Cliche

arXiv:2308.14652·cs.AI·August 29, 2023

Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm

Colin Bellinger, Laurence Lamarche-Cliche

PDF

Open Access 1 Repo

TL;DR

This paper explores applying deep reinforcement learning to enable a UR10e robotic arm to learn visual tracking and reaching tasks, demonstrating that proximal policy optimization outperforms deep Q-learning in stability and data efficiency.

Contribution

The study develops specialized reinforcement learning environments for the UR10e robot and compares deep Q-learning with proximal policy optimization, highlighting the latter's advantages.

Findings

01

Proximal policy optimization learns more stable policies.

02

PPO requires less training data than DQN.

03

Initial results show promise for RL in industrial robotics.

Abstract

As technology progresses, industrial and scientific robots are increasingly being used in diverse settings. In many cases, however, programming the robot to perform such tasks is technically complex and costly. To maximize the utility of robots in industrial and scientific settings, they require the ability to quickly shift from one task to another. Reinforcement learning algorithms provide the potential to enable robots to learn optimal solutions to complete new tasks without directly reprogramming them. The current state-of-the-art in reinforcement learning, however, generally relies on fast simulations and parallelization to achieve optimal performance. These are often not possible in robotics applications. Thus, a significant amount of research is required to facilitate the efficient and safe, training and deployment of industrial and scientific reinforcement learning robots. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cbellinger27/bendrl_reacher_tracker
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsQ-Learning