Learning Human-Robot Handovers Through $\pi$-STAM: Policy Improvement With Spatio-Temporal Affordance Maps
Francesco Riccio, Roberto Capobianco, Daniele Nardi

TL;DR
This paper introduces $1c-b0STAM, an iterative algorithm that learns spatial affordances to improve human-robot handovers, enabling efficient policy learning with fewer episodes and the incorporation of prior knowledge.
Contribution
The paper presents a novel algorithm, 1c-b0STAM, for learning spatial affordances in human-robot handovers, addressing high-dimensionality and interpretability issues in robot policy learning.
Findings
Efficient policy learning with few training episodes.
Reduced computational load and learning time.
Successful validation on simulation and real NAO robot.
Abstract
Human-robot handovers are characterized by high uncertainty and poor structure of the problem that make them difficult tasks. While machine learning methods have shown promising results, their application to problems with large state dimensionality, such as in the case of humanoid robots, is still limited. Additionally, by using these methods and during the interaction with the human operator, no guarantees can be obtained on the correct interpretation of spatial constraints (e.g., from social rules). In this paper, we present Policy Improvement with Spatio-Temporal Affordance Maps -- -STAM, a novel iterative algorithm to learn spatial affordances and generate robot behaviors. Our goal consists in generating a policy that adapts to the unknown action semantics by using affordances. In this way, while learning to perform a human-robot handover task, we can (1) efficiently generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications
