Cross-modality Force and Language Embeddings for Natural Human-Robot Communication
Ravi Tejwani, Karl Velazquez, John Payne, Paolo Bonato, Harry Asada

TL;DR
This paper introduces a novel framework for embedding verbal and force profile data into a shared space to enhance natural human-robot communication through coordinated verbal and haptic cues.
Contribution
It presents a new method for cross-modality embedding of language and force profiles, enabling effective integration and substitution in human-robot interaction.
Findings
Language and force profiles can be embedded in a unified latent space.
Proximity in the latent space indicates effective coordination.
The framework supports supplementing, integrating, and substituting communication modalities.
Abstract
A method for cross-modality embedding of force profile and words is presented for synergistic coordination of verbal and haptic communication. When two people carry a large, heavy object together, they coordinate through verbal communication about the intended movements and physical forces applied to the object. This natural integration of verbal and physical cues enables effective coordination. Similarly, human-robot interaction could achieve this level of coordination by integrating verbal and haptic communication modalities. This paper presents a framework for embedding words and force profiles in a unified manner, so that the two communication modalities can be integrated and coordinated in a way that is effective and synergistic. Here, it will be shown that, although language and physical force profiles are deemed completely different, the two can be embedded in a unified latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Robotics and Automated Systems · Speech and dialogue systems
