To Share or not to Share: Predicting Sets of Sources for Model Transfer   Learning

Lukas Lange; Jannik Str\"otgen; Heike Adel; Dietrich Klakow

arXiv:2104.08078·cs.CL·November 1, 2021

To Share or not to Share: Predicting Sets of Sources for Model Transfer Learning

Lukas Lange, Jannik Str\"otgen, Heike Adel, Dietrich Klakow

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method for predicting effective transfer sources in low-resource settings, improving sequence labeling performance by up to 24 F1 points through model similarity and SVM-based predictions.

Contribution

It presents a new approach combining model similarity and SVMs to automatically select sources for transfer learning, addressing limitations of previous ranking methods.

Findings

01

Predicts promising sources with up to 24 F1 points performance gain.

02

Shows effectiveness across various domains and tasks.

03

Demonstrates that source selection improves transfer learning outcomes.

Abstract

In low-resource settings, model transfer can help to overcome a lack of labeled data for many tasks and domains. However, predicting useful transfer sources is a challenging problem, as even the most similar sources might lead to unexpected negative transfer results. Thus, ranking methods based on task and text similarity -- as suggested in prior work -- may not be sufficient to identify promising sources. To tackle this problem, we propose a new approach to automatically determine which and how many sources should be exploited. For this, we study the effects of model transfer on sequence labeling across various domains and tasks and show that our methods based on model similarity and support vector machines are able to predict promising sources, resulting in performance increases of up to 24 F1 points.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

boschresearch/predicting_sets_of_sources
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis