ReactMotion: Generating Reactive Listener Motions from Speaker Utterance

Cheng Luo; Bizhu Wu; Bing Li; Jianfeng Ren; Ruibin Bai; Rong Qu; Linlin Shen; Bernard Ghanem

arXiv:2603.15083·cs.CV·March 17, 2026

ReactMotion: Generating Reactive Listener Motions from Speaker Utterance

Cheng Luo, Bizhu Wu, Bing Li, Jianfeng Ren, Ruibin Bai, Rong Qu, Linlin Shen, Bernard Ghanem

PDF

Open Access

TL;DR

This paper introduces ReactMotion, a new task and dataset for generating natural listener motions in response to speaker utterances, addressing the challenge of modeling non-deterministic human reactions.

Contribution

It presents ReactMotionNet dataset capturing multiple listener responses and a unified generative framework ReactMotion trained with preference-based objectives.

Findings

01

ReactMotion outperforms retrieval baselines.

02

Generated motions are more natural and diverse.

03

Preference-oriented evaluation protocols improve assessment.

Abstract

In this paper, we introduce a new task, Reactive Listener Motion Generation from Speaker Utterance, which aims to generate naturalistic listener body motions that appropriately respond to a speaker's utterance. However, modeling such nonverbal listener behaviors remains underexplored and challenging due to the inherently non-deterministic nature of human reactions. To facilitate this task, we present ReactMotionNet, a large-scale dataset that pairs speaker utterances with multiple candidate listener motions annotated with varying degrees of appropriateness. This dataset design explicitly captures the one-to-many nature of listener behavior and provides supervision beyond a single ground-truth motion. Building on this dataset design, we develop preference-oriented evaluation protocols tailored to evaluate reactive appropriateness, where conventional motion metrics focusing on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Emotion and Mood Recognition · Face recognition and analysis