Is Imitation All You Need? Generalized Decision-Making with Dual-Phase   Training

Yao Wei; Yanchao Sun; Ruijie Zheng; Sai Vemprala; Rogerio; Bonatti; Shuhang Chen; Ratnesh Madaan; Zhongjie Ba; Ashish Kapoor; and Shuang Ma

arXiv:2307.07909·cs.AI·October 10, 2023·1 cites

Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

Yao Wei, Yanchao Sun, Ruijie Zheng, Sai Vemprala, Rogerio, Bonatti, Shuhang Chen, Ratnesh Madaan, Zhongjie Ba, Ashish Kapoor, and Shuang Ma

PDF

Open Access 1 Repo

TL;DR

DualMind introduces a dual-phase training approach for generalist decision-making agents, enabling zero-shot task generalization across diverse domains without task-specific fine-tuning.

Contribution

The paper proposes a novel dual-phase training strategy that improves generalization and reduces overfitting in decision-making agents across multiple domains.

Findings

01

Outperforms previous generalist agents by over 50% on Habitat and 70% on MetaWorld.

02

Successfully completes over 30 MetaWorld tasks at 90% success rate.

03

Demonstrates zero-shot generalization across diverse tasks and environments.

Abstract

We introduce DualMind, a generalist agent designed to tackle various decision-making tasks that addresses challenges posed by current methods, such as overfitting behaviors and dependence on task-specific fine-tuning. DualMind uses a novel "Dual-phase" training strategy that emulates how humans learn to act in the world. The model first learns fundamental common knowledge through a self-supervised objective tailored for control tasks and then learns how to make decisions based on different contexts through imitating behaviors conditioned on given prompts. DualMind can handle tasks across domains, scenes, and embodiments using just a single set of model weights and can execute zero-shot prompting without requiring task-specific fine-tuning. We evaluate DualMind on MetaWorld and Habitat through extensive experiments and demonstrate its superior generalizability compared to previous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yunyikristy/dualmind
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Machine Learning and Data Classification