Coordinated Humanoid Manipulation with Choice Policies
Haozhi Qi, Yen-Jen Wang, Toru Lin, Brent Yi, Yi Ma, Koushil Sreenath, Jitendra Malik

TL;DR
This paper introduces a modular teleoperation system combined with a novel Choice Policy imitation learning approach, enabling humanoid robots to perform coordinated manipulation tasks more effectively in unstructured environments.
Contribution
The paper presents the Choice Policy, a new imitation learning method that generates and scores multiple candidate actions, improving multimodal behavior modeling in humanoid manipulation.
Findings
Choice Policy outperforms diffusion policies and behavior cloning.
Modular teleoperation facilitates efficient demonstration collection.
Hand-eye coordination is crucial for long-horizon tasks.
Abstract
Humanoid robots hold great promise for operating in human-centric environments, yet achieving robust whole-body coordination across the head, hands, and legs remains a major challenge. We present a system that combines a modular teleoperation interface with a scalable learning framework to address this problem. Our teleoperation design decomposes humanoid control into intuitive submodules, which include hand-eye coordination, grasp primitives, arm end-effector tracking, and locomotion. This modularity allows us to collect high-quality demonstrations efficiently. Building on this, we introduce Choice Policy, an imitation learning approach that generates multiple candidate actions and learns to score them. This architecture enables both fast inference and effective modeling of multimodal behaviors. We validate our approach on two real-world tasks: dishwasher loading and whole-body…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Social Robot Interaction and HRI · Robotic Locomotion and Control
