Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments
Haritheja Etukuru, Norihito Naka, Zijin Hu, Seungjae Lee, Julian Mehu,, Aaron Edsinger, Chris Paxton, Soumith Chintala, Lerrel Pinto, Nur Muhammad, Mahi Shafiullah

TL;DR
This paper introduces Robot Utility Models (RUMs), a framework enabling zero-shot deployment of robot policies in new environments without fine-tuning, using multi-modal imitation learning and efficient data collection.
Contribution
The work presents a novel framework for zero-shot robot policy deployment, including new tools for data collection, integration into policies, and on-device deployment on a low-cost robot.
Findings
Achieves 90% success rate in unseen environments
Policies generalize across different robot and camera setups
Highlights importance of diverse, high-quality demonstrations
Abstract
Robot models, particularly those trained with large amounts of data, have recently shown a plethora of real-world manipulation and navigation capabilities. Several independent efforts have shown that given sufficient training data in an environment, robot policies can generalize to demonstrated variations in that environment. However, needing to finetune robot models to every new environment stands in stark contrast to models in language or vision that can be deployed zero-shot for open-world problems. In this work, we present Robot Utility Models (RUMs), a framework for training and deploying zero-shot robot policies that can directly generalize to new environments without any finetuning. To create RUMs efficiently, we develop new tools to quickly collect data for mobile manipulation tasks, integrate such data into a policy with multi-modal imitation learning, and deploy policies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpace Satellite Systems and Control · Space Exploration and Technology
Methodsbye · Context Aggregated Bi-lateral Network for Semantic Segmentation
