Policy Adaptation via Language Optimization: Decomposing Tasks for   Few-Shot Imitation

Vivek Myers; Bill Chunyuan Zheng; Oier Mees; Sergey Levine; Kuan Fang

arXiv:2408.16228·cs.RO·January 9, 2025

Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation

Vivek Myers, Bill Chunyuan Zheng, Oier Mees, Sergey Levine, Kuan Fang

PDF

Open Access

TL;DR

This paper introduces PALO, a method that leverages vision-language models to decompose tasks and enable rapid few-shot adaptation of robot policies to new, complex tasks without extensive retraining.

Contribution

The paper presents a novel approach combining task decomposition with language optimization for effective few-shot adaptation in robot manipulation.

Findings

01

PALO outperforms existing policies on complex real-world tasks.

02

It enables rapid adaptation without large fine-tuning datasets.

03

Successfully handles long-horizon, multi-step tasks in real environments.

Abstract

Learned language-conditioned robot policies often struggle to effectively adapt to new real-world tasks even when pre-trained across a diverse set of instructions. We propose a novel approach for few-shot adaptation to unseen tasks that exploits the semantic understanding of task decomposition provided by vision-language models (VLMs). Our method, Policy Adaptation via Language Optimization (PALO), combines a handful of demonstrations of a task with proposed language decompositions sampled from a VLM to quickly enable rapid nonparametric adaptation, avoiding the need for a larger fine-tuning dataset. We evaluate PALO on extensive real-world experiments consisting of challenging unseen, long-horizon robot manipulation tasks. We find that PALO is able of consistently complete long-horizon, multi-tier tasks in the real world, outperforming state of the art pre-trained generalist policies,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications

MethodsSparse Evolutionary Training