Neural Supervised Domain Adaptation by Augmenting Pre-trained Models with Random Units
Sara Meftah, Nasredine Semmar, Youssef Tamaazousti, Hassane Essafi,, Fatiha Sadat

TL;DR
This paper introduces a method to improve neural domain adaptation in NLP by augmenting pre-trained models with randomly initialized units, addressing limitations of standard fine-tuning and enhancing performance across multiple tasks.
Contribution
It proposes augmenting pre-trained models with random units to better capture target domain patterns and reduce negative transfer effects in neural domain adaptation.
Findings
Significant performance improvements on four NLP tasks.
Enhanced adaptation from news to social media domains.
Reduction of negative transfer effects.
Abstract
Neural Transfer Learning (TL) is becoming ubiquitous in Natural Language Processing (NLP), thanks to its high performance on many tasks, especially in low-resourced scenarios. Notably, TL is widely used for neural domain adaptation to transfer valuable knowledge from high-resource to low-resource domains. In the standard fine-tuning scheme of TL, a model is initially pre-trained on a source domain and subsequently fine-tuned on a target domain and, therefore, source and target domains are trained using the same architecture. In this paper, we show through interpretation methods that such scheme, despite its efficiency, is suffering from a main limitation. Indeed, although capable of adapting to new domains, pre-trained neurons struggle with learning certain patterns that are specific to the target domain. Moreover, we shed light on the hidden negative transfer occurring despite the high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
