Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Massimiliano Patacchiola, Mingfei Sun, Katja Hofmann, Richard E., Turner

TL;DR
This paper compares fine-tuning and meta-learning approaches for few-shot imitation learning in control tasks, showing that fine-tuning is a practical and competitive alternative to meta-learning, especially in out-of-domain scenarios.
Contribution
It introduces a simple two-stage fine-tuning baseline for few-shot imitation in control, demonstrating its effectiveness and releasing a new dataset for further research.
Findings
Fine-tuning rivals meta-learning in control imitation tasks.
The proposed baseline is practical and easy to implement.
The dataset iMuJoCo supports future research in this area.
Abstract
In this paper we explore few-shot imitation learning for control problems, which involves learning to imitate a target policy by accessing a limited set of offline rollouts. This setting has been relatively under-explored despite its relevance to robotics and control applications. State-of-the-art methods developed to tackle few-shot imitation rely on meta-learning, which is expensive to train as it requires access to a distribution over tasks (rollouts from many target policies and variations of the base environment). Given this limitation we investigate an alternative approach, fine-tuning, a family of methods that pretrain on a single dataset and then fine-tune on unseen domain-specific data. Recent work has shown that fine-tuners outperform meta-learners in few-shot image classification tasks, especially when the data is out-of-domain. Here we evaluate to what extent this is true…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsBalanced Selection
