TL;DR
This paper introduces MoVEInt, a novel method combining Variational Autoencoders and Mixture Density Networks to learn shared, multimodal latent spaces for human-robot interactions, improving robot motion generation from demonstrations.
Contribution
It proposes a new approach that integrates MDN priors into VAEs for more accurate, multimodal HRI modeling, addressing mode collapse and outperforming traditional HMM/GMM methods.
Findings
More accurate robot motions compared to previous methods
Effective in diverse HRI scenarios like handshakes and handovers
Successful real-world human-robot interaction demonstrations
Abstract
Shared dynamics models are important for capturing the complexity and variability inherent in Human-Robot Interaction (HRI). Therefore, learning such shared dynamics models can enhance coordination and adaptability to enable successful reactive interactions with a human partner. In this work, we propose a novel approach for learning a shared latent space representation for HRIs from demonstrations in a Mixture of Experts fashion for reactively generating robot actions from human observations. We train a Variational Autoencoder (VAE) to learn robot motions regularized using an informative latent space prior that captures the multimodality of the human observations via a Mixture Density Network (MDN). We show how our formulation derives from a Gaussian Mixture Regression formulation that is typically used approaches for learning HRI from demonstrations such as using an HMM/GMM for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
