CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
Jiawei Gao, Ziqin Wang, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao,, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang

TL;DR
CooHOI introduces a two-phase learning framework enabling multi-humanoid robots to collaboratively manipulate objects by learning individual skills and then transferring policies for coordinated interaction, without relying on multi-humanoid motion capture data.
Contribution
The paper presents a novel two-phase learning paradigm for multi-humanoid collaboration that leverages imitation learning and multi-agent reinforcement learning, avoiding the need for multi-humanoid motion capture data.
Findings
Effective multi-humanoid collaboration in object transportation
Implicit communication achieved through shared object dynamics
Framework adaptable to various participants and object types
Abstract
Enabling humanoid robots to clean rooms has long been a pursued dream within humanoid research communities. However, many tasks require multi-humanoid collaboration, such as carrying large and heavy furniture together. Given the scarcity of motion capture data on multi-humanoid collaboration and the efficiency challenges associated with multi-agent learning, these tasks cannot be straightforwardly addressed using training paradigms designed for single-agent scenarios. In this paper, we introduce Cooperative Human-Object Interaction (CooHOI), a framework designed to tackle the challenge of multi-humanoid object transportation problem through a two-phase learning paradigm: individual skill learning and subsequent policy transfer. First, a single humanoid character learns to interact with objects through imitation learning from human motion priors. Then, the humanoid learns to collaborate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Multimodal Machine Learning Applications
