Can You Label Less by Using Out-of-Domain Data? Active & Transfer   Learning with Few-shot Instructions

Rafal Kocielnik; Sara Kangaslahti; Shrimai Prabhumoye; Meena Hari; R.; Michael Alvarez; Anima Anandkumar

arXiv:2211.11798·cs.CL·November 23, 2022

Can You Label Less by Using Out-of-Domain Data? Active & Transfer Learning with Few-shot Instructions

Rafal Kocielnik, Sara Kangaslahti, Shrimai Prabhumoye, Meena Hari, R., Michael Alvarez, Anima Anandkumar

PDF

Open Access

TL;DR

This paper introduces ATF, a novel method that leverages pre-trained language models for active transfer learning without fine-tuning, significantly reducing annotation effort in social media toxicity and bias detection.

Contribution

The paper proposes ATF, a fine-tuning-free active transfer learning approach that effectively transfers knowledge from labeled datasets to new domains using pre-trained language models.

Findings

01

ATF achieves a 10.5% mean AUC gain over no transfer.

02

Active learning with few samples enhances transfer benefits.

03

Transfer success depends on the initial performance of PLMs on target tasks.

Abstract

Labeling social-media data for custom dimensions of toxicity and social bias is challenging and labor-intensive. Existing transfer and active learning approaches meant to reduce annotation effort require fine-tuning, which suffers from over-fitting to noise and can cause domain shift with small sample sizes. In this work, we propose a novel Active Transfer Few-shot Instructions (ATF) approach which requires no fine-tuning. ATF leverages the internal linguistic knowledge of pre-trained language models (PLMs) to facilitate the transfer of information from existing pre-labeled datasets (source-domain task) with minimum labeling effort on unlabeled target data (target-domain task). Our strategy can yield positive transfer achieving a mean AUC gain of 10.5% compared to no transfer with a large 22b parameter PLM. We further show that annotation of just a few target-domain samples via active…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Speech Recognition and Synthesis