Domain Adaptation of Visual Policies with a Single Demonstration

Weiyao Wang; Gregory D. Hager

arXiv:2407.16820·cs.RO·July 25, 2024

Domain Adaptation of Visual Policies with a Single Demonstration

Weiyao Wang, Gregory D. Hager

PDF

TL;DR

PromptAdapt is a novel framework that uses a single demonstration and Transformer models to enable visuomotor policies to adapt effectively to new environments with domain shifts, outperforming traditional domain randomization methods.

Contribution

We introduce PromptAdapt, a demonstration-conditioned visual policy framework utilizing Transformers for in-context adaptation to diverse target domains.

Findings

01

Outperforms baseline methods under various domain shifts

02

Effective in both simulation and real-world environments

03

Handles lighting, color, texture, and camera pose variations

Abstract

Deploying machine learning algorithms for robot tasks in real-world applications presents a core challenge: overcoming the domain gap between the training and the deployment environment. This is particularly difficult for visuomotor policies that utilize high-dimensional images as input, particularly when those images are generated via simulation. A common method to tackle this issue is through domain randomization, which aims to broaden the span of the training distribution to cover the test-time distribution. However, this approach is only effective when the domain randomization encompasses the actual shifts in the test-time distribution. We take a different approach, where we make use of a single demonstration (a prompt) to learn policy that adapts to the testing target environment. Our proposed framework, PromptAdapt, leverages the Transformer architecture's capacity to model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.