Task-guided Disentangled Tuning for Pretrained Language Models

Jiali Zeng; Yufan Jiang; Shuangzhi Wu; Yongjing Yin; Mu Li

arXiv:2203.11431·cs.CL·March 23, 2022

Task-guided Disentangled Tuning for Pretrained Language Models

Jiali Zeng, Yufan Jiang, Shuangzhi Wu, Yongjing Yin, Mu Li

PDF

Open Access 1 Repo

TL;DR

Task-guided Disentangled Tuning (TDT) improves the adaptation of pretrained language models to specific NLP tasks by disentangling task-relevant signals, leading to better performance especially in low-data scenarios.

Contribution

The paper introduces TDT, a novel method that enhances PLMs' generalization by disentangling task-specific signals using a learnable confidence model and regularization.

Findings

01

TDT outperforms standard fine-tuning on GLUE and CLUE benchmarks.

02

TDT demonstrates robustness across different PLMs and tasks.

03

Disentangling task signals improves low-data regime performance.

Abstract

Pretrained language models (PLMs) trained on large-scale unlabeled corpus are typically fine-tuned on task-specific downstream datasets, which have produced state-of-the-art results on various NLP tasks. However, the data discrepancy issue in domain and scale makes fine-tuning fail to efficiently capture task-specific patterns, especially in the low data regime. To address this issue, we propose Task-guided Disentangled Tuning (TDT) for PLMs, which enhances the generalization of representations by disentangling task-relevant signals from the entangled representations. For a given task, we introduce a learnable confidence model to detect indicative guidance from context, and further propose a disentangled regularization to mitigate the over-reliance problem. Experimental results on GLUE and CLUE benchmarks show that TDT gives consistently better results than fine-tuning with different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lemon0830/tdt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification