ExpressMM: Expressive Mobile Manipulation Behaviors in Human-Robot Interactions

Souren Pashangpour; Haitong Wang; Matthew Lisondra; Goldie Nejat

arXiv:2604.05320·cs.RO·April 24, 2026

ExpressMM: Expressive Mobile Manipulation Behaviors in Human-Robot Interactions

Souren Pashangpour, Haitong Wang, Matthew Lisondra, Goldie Nejat

PDF

TL;DR

This paper introduces ExpressMM, a framework enabling mobile robots to perform expressive, interpretable, and interruptible behaviors during human-robot collaboration, enhancing communication and safety.

Contribution

Develops a novel high-level language-guided planning framework with low-level perception and action policies for expressive, interruptible behaviors in human-robot interaction.

Findings

01

ExpressMM improves clarity of robot intentions for observers.

02

Participants found the robot's behaviors useful, predictable, and safe.

03

ExpressMM supports dynamic interaction updates during tasks.

Abstract

Mobile manipulators are increasingly deployed in human-centered environments to perform tasks. While completing such tasks, they should also be able to communicate their intent to the people around them using expressive robot behaviors. Prior work on expressive robot behaviors has used preprogrammed or learning-from-demonstration-based expressive motions and large language model generated high-level interactions. The majority of these existing approaches have not considered human-robot interactions (HRI) where users may interrupt, modify, or redirect a robot's actions during task execution. In this paper, we develop the novel ExpressMM framework that integrates a high-level language-guided planner based on a vision-language model for perception and conversational reasoning with a low-level vision-language-action policy to generate expressive robot behaviors during collaborative HRI…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.