Prior Omission of Dissimilar Source Domain(s) for Cost-Effective   Few-Shot Learning

Zezhong Wang; Hongru Wang; Kwan Wai Chung; Jia Zhu; Gabriel Pui Cheong; Fung; Kam-Fai Wong

arXiv:2109.05234·cs.CL·September 14, 2021·1 cites

Prior Omission of Dissimilar Source Domain(s) for Cost-Effective Few-Shot Learning

Zezhong Wang, Hongru Wang, Kwan Wai Chung, Jia Zhu, Gabriel Pui Cheong, Fung, Kam-Fai Wong

PDF

Open Access

TL;DR

This paper introduces a similarity-based data selection method and a Shared-Private Network for few-shot slot tagging, effectively reducing negative transfer from dissimilar source domains and improving adaptation performance.

Contribution

It proposes a novel similarity-based source data selection technique and a Shared-Private Network architecture for more effective few-shot domain adaptation in slot tagging.

Findings

01

Outperforms state-of-the-art methods with fewer source data

02

Redundant data from dissimilar sources can harm adaptation

03

Shared features improve label prediction in limited data scenarios

Abstract

Few-shot slot tagging is an emerging research topic in the field of Natural Language Understanding (NLU). With sufficient annotated data from source domains, the key challenge is how to train and adapt the model to another target domain which only has few labels. Conventional few-shot approaches use all the data from the source domains without considering inter-domain relations and implicitly assume each sample in the domain contributes equally. However, our experiments show that the data distribution bias among different domains will significantly affect the adaption performance. Moreover, transferring knowledge from dissimilar domains will even introduce some extra noises so that affect the performance of models. To tackle this problem, we propose an effective similarity-based method to select data from the source domains. In addition, we propose a Shared-Private Network (SP-Net) for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications