DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Divya Jyoti Bajpai; Manjesh Kumar Hanawal

arXiv:2410.04424·cs.CL·October 8, 2024

DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs

Divya Jyoti Bajpai, Manjesh Kumar Hanawal

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces DADEE, an unsupervised domain adaptation framework for early exit pre-trained language models, improving efficiency and robustness across domains through multi-level adversarial knowledge distillation.

Contribution

DADEE is the first to combine unsupervised domain adaptation with early exit strategies in PLMs using multi-level adversarial training and knowledge distillation.

Findings

01

DADEE outperforms existing early exit methods in domain shift scenarios.

02

DADEE reduces inference latency while maintaining high accuracy.

03

DADEE enhances domain invariance across all model layers.

Abstract

Pre-trained Language Models (PLMs) exhibit good accuracy and generalization ability across various tasks using self-supervision, but their large size results in high inference latency. Early Exit (EE) strategies handle the issue by allowing the samples to exit from classifiers attached to the intermediary layers, but they do not generalize well, as exit classifiers can be sensitive to domain changes. To address this, we propose Unsupervised Domain Adaptation in EE framework (DADEE) that employs multi-level adaptation using knowledge distillation. DADEE utilizes GAN-based adversarial adaptation at each layer to achieve domain-invariant representations, reducing the domain gap between the source and target domain across all layers. The attached exits not only speed up inference but also enhance domain adaptation by reducing catastrophic forgetting and mode collapse, making it more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

div290/dadee
pytorchOfficial

Videos

DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs· underline

Taxonomy

TopicsSoftware System Performance and Reliability · Topic Modeling · Online Learning and Analytics

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings