Multi-Task Interactive Robot Fleet Learning with Visual World Models
Huihan Liu, Yu Zhang, Vaarij Betala, Evan Zhang, James Liu, Crystal, Ding, Yuke Zhu

TL;DR
Sirius-Fleet is a multi-task robot fleet learning framework that uses visual world models and human-in-the-loop corrections to improve robustness, reduce human workload, and enhance multi-task policy performance in diverse environments.
Contribution
The paper introduces Sirius-Fleet, a novel multi-task fleet learning approach that integrates visual world models and adaptive anomaly predictors to improve generalization and reduce human intervention.
Findings
Enhanced multi-task policy performance in benchmarks
Reduced human intervention over time
Effective anomaly prediction and correction mechanisms
Abstract
Recent advancements in large-scale multi-task robot learning offer the potential for deploying robot fleets in household and industrial settings, enabling them to perform diverse tasks across various environments. However, AI-enabled robots often face challenges with generalization and robustness when exposed to real-world variability and uncertainty. We introduce Sirius-Fleet, a multi-task interactive robot fleet learning framework to address these challenges. Sirius-Fleet monitors robot performance during deployment and involves humans to correct the robot's actions when necessary. We employ a visual world model to predict the outcomes of future actions and build anomaly predictors to predict whether they will likely result in anomalies. As the robot autonomy improves, the anomaly predictors automatically adapt their prediction criteria, leading to fewer requests for human…
Peer Reviews
Decision·CoRL 2024
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Metaheuristic Optimization Algorithms Research · Robotic Path Planning Algorithms
