Active Fine-Tuning of Multi-Task Policies

Marco Bagatella; Jonas H\"ubotter; Georg Martius; Andreas Krause

arXiv:2410.05026·cs.LG·June 24, 2025

Active Fine-Tuning of Multi-Task Policies

Marco Bagatella, Jonas H\"ubotter, Georg Martius, Andreas Krause

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces AMF, an active learning algorithm that adaptively selects demonstrations to efficiently fine-tune multi-task policies, improving performance with limited data in complex environments.

Contribution

The paper proposes AMF, a novel active learning approach for multi-task policy fine-tuning that maximizes information gain and provides theoretical performance guarantees.

Findings

01

AMF outperforms baseline methods in efficiency and accuracy.

02

AMF effectively adapts to complex, high-dimensional environments.

03

Theoretical guarantees support the effectiveness of AMF.

Abstract

Pre-trained generalist policies are rapidly gaining relevance in robot learning due to their promise of fast adaptation to novel, in-domain tasks. This adaptation often relies on collecting new demonstrations for a specific task of interest and applying imitation learning algorithms, such as behavioral cloning. However, as soon as several tasks need to be learned, we must decide which tasks should be demonstrated and how often? We study this multi-task problem and explore an interactive framework in which the agent adaptively selects the tasks to be demonstrated. We propose AMF (Active Multi-task Fine-tuning), an algorithm to maximize multi-task policy performance under a limited demonstration budget by collecting demonstrations yielding the largest information gain on the expert policy. We derive performance guarantees for AMF under regularity assumptions and demonstrate its empirical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marbaga/amf
pytorch

Videos

Active Fine-Tuning of Multi-Task Policies· slideslive

Taxonomy

TopicsComplex Systems and Decision Making