Embedding Hallucination for Few-Shot Language Fine-tuning

Yiren Jian; Chongyang Gao; Soroush Vosoughi

arXiv:2205.01307·cs.CL·May 4, 2022

Embedding Hallucination for Few-Shot Language Fine-tuning

Yiren Jian, Chongyang Gao, Soroush Vosoughi

PDF

Open Access 1 Repo

TL;DR

EmbedHalluc is a novel data augmentation technique for few-shot language fine-tuning that generates auxiliary embeddings to improve model generalization and reduce over-fitting, outperforming existing methods.

Contribution

We introduce EmbedHalluc, a method that creates synthetic embedding-label pairs via adversarial training to enhance few-shot language model fine-tuning.

Findings

01

EmbedHalluc improves performance across various language tasks.

02

It outperforms data augmentation, pseudo-labeling, and regularization methods.

03

The approach effectively mitigates over-fitting in few-shot learning.

Abstract

Few-shot language learners adapt knowledge from a pre-trained model to recognize novel classes from a few-labeled sentences. In such settings, fine-tuning a pre-trained language model can cause severe over-fitting. In this paper, we propose an Embedding Hallucination (EmbedHalluc) method, which generates auxiliary embedding-label pairs to expand the fine-tuning dataset. The hallucinator is trained by playing an adversarial game with the discriminator, such that the hallucinated embedding is indiscriminative to the real ones in the fine-tuning dataset. By training with the extended dataset, the language learner effectively learns from the diverse hallucinated embeddings to overcome the over-fitting issue. Experiments demonstrate that our proposed method is effective in a wide range of language tasks, outperforming current fine-tuning methods. Further, we show that EmbedHalluc outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yiren-jian/embedhalluc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques