Improving Assistive Robotics with Deep Reinforcement Learning
Yash Jakhotiya, Iman Haque

TL;DR
This paper evaluates reinforcement learning techniques for assistive robotics, replicates a baseline in simulation, and explores advanced neural network methods, finding mixed results and discussing potential reasons for limited success.
Contribution
It replicates a reinforcement learning baseline for assistive tasks and investigates the effectiveness of recurrent neural networks and Phasic Policy Gradient methods.
Findings
Baseline implementation matches or exceeds original performance.
Advanced methods did not outperform the baseline as expected.
Discussion on challenges and future directions for RL in assistive robotics.
Abstract
Assistive Robotics is a class of robotics concerned with aiding humans in daily care tasks that they may be inhibited from doing due to disabilities or age. While research has demonstrated that classical control methods can be used to design policies to complete these tasks, these methods can be difficult to generalize to a variety of instantiations of a task. Reinforcement learning can provide a solution to this issue, wherein robots are trained in simulation and their policies are transferred to real-world machines. In this work, we replicate a published baseline for training robots on three tasks in the Assistive Gym environment, and we explore the usage of a Recurrent Neural Network and Phasic Policy Gradient learning to augment the original work. Our baseline implementation meets or exceeds the baseline of the original work, however, we found that our explorations into the new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Prosthetics and Rehabilitation Robotics · Stroke Rehabilitation and Recovery
