Cross-modality Force and Language Embeddings for Natural Human-Robot   Communication

Ravi Tejwani; Karl Velazquez; John Payne; Paolo Bonato; Harry Asada

arXiv:2502.02772·cs.RO·April 29, 2025

Cross-modality Force and Language Embeddings for Natural Human-Robot Communication

Ravi Tejwani, Karl Velazquez, John Payne, Paolo Bonato, Harry Asada

PDF

Open Access

TL;DR

This paper introduces a novel framework for embedding verbal and force profile data into a shared space to enhance natural human-robot communication through coordinated verbal and haptic cues.

Contribution

It presents a new method for cross-modality embedding of language and force profiles, enabling effective integration and substitution in human-robot interaction.

Findings

01

Language and force profiles can be embedded in a unified latent space.

02

Proximity in the latent space indicates effective coordination.

03

The framework supports supplementing, integrating, and substituting communication modalities.

Abstract

A method for cross-modality embedding of force profile and words is presented for synergistic coordination of verbal and haptic communication. When two people carry a large, heavy object together, they coordinate through verbal communication about the intended movements and physical forces applied to the object. This natural integration of verbal and physical cues enables effective coordination. Similarly, human-robot interaction could achieve this level of coordination by integrating verbal and haptic communication modalities. This paper presents a framework for embedding words and force profiles in a unified manner, so that the two communication modalities can be integrated and coordinated in a way that is effective and synergistic. Here, it will be shown that, although language and physical force profiles are deemed completely different, the two can be embedded in a unified latent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Robotics and Automated Systems · Speech and dialogue systems