Learning Latent Actions without Human Demonstrations
Shaunak A. Mehta, Sagar Parekh, Dylan P. Losey

TL;DR
This paper introduces an unsupervised method for learning diverse robot behaviors from object state changes, enabling control without human demonstrations and outperforming demonstration-based methods in some scenarios.
Contribution
The authors propose a novel unsupervised approach to learn a latent space of object-oriented behaviors for robot control, eliminating the need for human demonstrations.
Findings
Unsupervised approach can outperform demonstration-based mappings with noisy data.
Participants completed tasks faster with the unsupervised robot, but experienced confusion due to unexpected behaviors.
Abstract
We can make it easier for disabled users to control assistive robots by mapping the user's low-dimensional joystick inputs to high-dimensional, complex actions. Prior works learn these mappings from human demonstrations: a non-disabled human either teleoperates or kinesthetically guides the robot arm through a variety of motions, and the robot learns to reproduce the demonstrated behaviors. But this framework is often impractical - disabled users will not always have access to external demonstrations! Here we instead learn diverse teleoperation mappings without either human demonstrations or pre-defined tasks. Under our unsupervised approach the robot first optimizes for object state entropy: i.e., the robot autonomously learns to push, pull, open, close, or otherwise change the state of nearby objects. We then embed these diverse, object-oriented behaviors into a latent space for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Social Robot Interaction and HRI · Multimodal Machine Learning Applications
