Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot   Policy Imitation

Massimiliano Patacchiola; Mingfei Sun; Katja Hofmann; Richard E.; Turner

arXiv:2306.13554·cs.LG·June 26, 2023

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

Massimiliano Patacchiola, Mingfei Sun, Katja Hofmann, Richard E., Turner

PDF

Open Access 4 Repos

TL;DR

This paper compares fine-tuning and meta-learning approaches for few-shot imitation learning in control tasks, showing that fine-tuning is a practical and competitive alternative to meta-learning, especially in out-of-domain scenarios.

Contribution

It introduces a simple two-stage fine-tuning baseline for few-shot imitation in control, demonstrating its effectiveness and releasing a new dataset for further research.

Findings

01

Fine-tuning rivals meta-learning in control imitation tasks.

02

The proposed baseline is practical and easy to implement.

03

The dataset iMuJoCo supports future research in this area.

Abstract

In this paper we explore few-shot imitation learning for control problems, which involves learning to imitate a target policy by accessing a limited set of offline rollouts. This setting has been relatively under-explored despite its relevance to robotics and control applications. State-of-the-art methods developed to tackle few-shot imitation rely on meta-learning, which is expensive to train as it requires access to a distribution over tasks (rollouts from many target policies and variations of the base environment). Given this limitation we investigate an alternative approach, fine-tuning, a family of methods that pretrain on a single dataset and then fine-tune on unseen domain-specific data. Recent work has shown that fine-tuners outperform meta-learners in few-shot image classification tasks, especially when the data is out-of-domain. Here we evaluate to what extent this is true…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsBalanced Selection