Embedding Hallucination for Few-Shot Language Fine-tuning
Yiren Jian, Chongyang Gao, Soroush Vosoughi

TL;DR
EmbedHalluc is a novel data augmentation technique for few-shot language fine-tuning that generates auxiliary embeddings to improve model generalization and reduce over-fitting, outperforming existing methods.
Contribution
We introduce EmbedHalluc, a method that creates synthetic embedding-label pairs via adversarial training to enhance few-shot language model fine-tuning.
Findings
EmbedHalluc improves performance across various language tasks.
It outperforms data augmentation, pseudo-labeling, and regularization methods.
The approach effectively mitigates over-fitting in few-shot learning.
Abstract
Few-shot language learners adapt knowledge from a pre-trained model to recognize novel classes from a few-labeled sentences. In such settings, fine-tuning a pre-trained language model can cause severe over-fitting. In this paper, we propose an Embedding Hallucination (EmbedHalluc) method, which generates auxiliary embedding-label pairs to expand the fine-tuning dataset. The hallucinator is trained by playing an adversarial game with the discriminator, such that the hallucinated embedding is indiscriminative to the real ones in the fine-tuning dataset. By training with the extended dataset, the language learner effectively learns from the diverse hallucinated embeddings to overcome the over-fitting issue. Experiments demonstrate that our proposed method is effective in a wide range of language tasks, outperforming current fine-tuning methods. Further, we show that EmbedHalluc outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
