Learning to Collaborate from Simulation for Robot-Assisted Dressing
Alexander Clegg, Zackory Erickson, Patrick Grady, Greg Turk, Charles, C. Kemp, and C. Karen Liu

TL;DR
This paper presents a simulation-based deep reinforcement learning approach for robot-assisted dressing, enabling robots to adapt to human impairments and transfer policies to real robots for improved collaboration.
Contribution
It introduces a novel DRL framework that models human impairments and multi-modal sensing, improving robot-human dressing collaboration in simulation and real-world transfer.
Findings
Training for specific impairments enhances performance
Scaling controller speed maintains performance with reduced speed
Curriculum learning reduces applied forces
Abstract
We investigated the application of haptic feedback control and deep reinforcement learning (DRL) to robot-assisted dressing. Our method uses DRL to simultaneously train human and robot control policies as separate neural networks using physics simulations. In addition, we modeled variations in human impairments relevant to dressing, including unilateral muscle weakness, involuntary arm motion, and limited range of motion. Our approach resulted in control policies that successfully collaborate in a variety of simulated dressing tasks involving a hospital gown and a T-shirt. In addition, our approach resulted in policies trained in simulation that enabled a real PR2 robot to dress the arm of a humanoid robot with a hospital gown. We found that training policies for specific impairments dramatically improved performance; that controller execution speed could be scaled after training to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
