ReMoGen: Real-time Human Interaction-to-Reaction Generation via Modular Learning from Diverse Data

Yaoqin Ye; Yiteng Xu; Qin Sun; Xinge Zhu; Yujing Sun; Yuexin Ma

arXiv:2604.01082·cs.CV·April 2, 2026

ReMoGen: Real-time Human Interaction-to-Reaction Generation via Modular Learning from Diverse Data

Yaoqin Ye, Yiteng Xu, Qin Sun, Xinge Zhu, Yujing Sun, Yuexin Ma

PDF

TL;DR

ReMoGen is a modular framework that enables real-time, high-fidelity human reaction generation in interactive scenarios by leveraging large-scale motion data and adaptive modules for diverse, data-scarce environments.

Contribution

It introduces ReMoGen, a novel modular learning approach that combines a universal motion prior with meta-interaction modules for robust, real-time human reaction synthesis.

Findings

01

ReMoGen produces high-quality, coherent reactions in diverse interaction settings.

02

The framework generalizes effectively across different interaction scenarios.

03

ReMoGen achieves low-latency responses suitable for online interaction.

Abstract

Human behaviors in real-world environments are inherently interactive, with an individual's motion shaped by surrounding agents and the scene. Such capabilities are essential for applications in virtual avatars, interactive animation, and human-robot collaboration. We target real-time human interaction-to-reaction generation, which generates the ego's future motion from dynamic multi-source cues, including others' actions, scene geometry, and optional high-level semantic inputs. This task is fundamentally challenging due to (i) limited and fragmented interaction data distributed across heterogeneous single-person, human-human, and human-scene domains, and (ii) the need to produce low-latency yet high-fidelity motion responses during continuous online interaction. To address these challenges, we propose ReMoGen (Reaction Motion Generation), a modular learning framework for real-time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.