Robustifying Long-term Human-Robot Collaboration through a Multimodal and Hierarchical Framework
Peiqi Yu, Abulikemu Abuduweili, Ruixuan Liu, Changliu Liu

TL;DR
This paper introduces a multimodal hierarchical framework for long-term human-robot collaboration, improving robustness, efficiency, and user satisfaction through integrated perception, plan prediction, and online adaptation.
Contribution
It presents a novel multimodal and hierarchical framework that enhances understanding and assistance in long-term HRC, with real-world deployment and extensive user studies.
Findings
Reduced task completion time by 15.9%
Achieved 91.8% success rate in tasks
User satisfaction score of 84%
Abstract
Long-term Human-Robot Collaboration (HRC) is crucial for enabling flexible manufacturing systems and integrating companion robots into daily human environments over extended periods. This paper identifies several key challenges for such collaborations, such as accurate recognition of human plan, robustness to disturbances, operational efficiency, adaptability to diverse user behaviors, and sustained human satisfaction. To address these challenges, we model the long-term HRC task through a hierarchical task graph and presents a novel multimodal and hierarchical framework to enable robots to better assist humans to advance on the task graph. In particular, the proposed multimodal framework integrates visual observations with speech commands to facilitate intuitive and flexible human-robot interactions. Additionally, our hierarchical designs for both human pose detection and plan…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning
