Less is More: Parameter-Efficient Selection of Intermediate Tasks for   Transfer Learning

David Schulte; Felix Hamborg; Alan Akbik

arXiv:2410.15148·cs.CL·October 22, 2024

Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning

David Schulte, Felix Hamborg, Alan Akbik

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces Embedding Space Maps (ESMs), a lightweight neural network approach that efficiently predicts the transferability of NLP tasks, enabling large-scale task selection with reduced computational resources.

Contribution

The authors propose ESMs to approximate fine-tuning effects, allowing scalable and resource-efficient task transferability ranking in NLP.

Findings

01

ESMs reduce execution time by a factor of 10.

02

ESMs decrease disk space usage by a factor of 278.

03

High transferability prediction accuracy with an average regret@5 score of 2.95.

Abstract

Intermediate task transfer learning can greatly improve model performance. If, for example, one has little training data for emotion detection, first fine-tuning a language model on a sentiment classification dataset may improve performance strongly. But which task to choose for transfer learning? Prior methods producing useful task rankings are infeasible for large source pools, as they require forward passes through all source language models. We overcome this by introducing Embedding Space Maps (ESMs), light-weight neural networks that approximate the effect of fine-tuning a language model. We conduct the largest study on NLP task transferability and task selection with 12k source-target pairs. We find that applying ESMs on a prior method reduces execution time and disk space usage by factors of 10 and 278, respectively, while retaining high selection performance (avg. regret@5 score…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

davidschulte/hf-dataset-selector
pytorchOfficial

Models

🤗
davidschulte/hyperparamter_test
model· 2 dl
2 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM