Soft Alignment Objectives for Robust Adaptation of Language Generation

Michal \v{S}tef\'anik; Marek Kadl\v{c}\'ik; Petr Sojka

arXiv:2211.16550·cs.CL·May 29, 2023

Soft Alignment Objectives for Robust Adaptation of Language Generation

Michal \v{S}tef\'anik, Marek Kadl\v{c}\'ik, Petr Sojka

PDF

Open Access 1 Repo

TL;DR

This paper proposes novel soft alignment training objectives for domain adaptation in language models, improving robustness and reducing catastrophic forgetting without significant computational overhead.

Contribution

Introduces semantic similarity-based training objectives that mitigate forgetting during domain adaptation while maintaining model quality and efficiency.

Findings

01

Mitigates catastrophic forgetting in domain adaptation.

02

Preserves language model quality during adaptation.

03

Adds negligible computational costs.

Abstract

Domain adaptation allows generative language models to address specific flaws caused by the domain shift of their application. However, the traditional adaptation by further training on in-domain data rapidly weakens the model's ability to generalize to other domains, making the open-ended deployments of the adapted models prone to errors. This work introduces novel training objectives built upon a semantic similarity of the predicted tokens to the reference. Our results show that (1) avoiding the common assumption of a single correct prediction by constructing the training target from tokens' semantic similarity can mitigate catastrophic forgetting during domain adaptation, while (2) preserving the quality of the adaptation, (3) with negligible additions to compute costs. In the broader context, the objectives grounded in a continuous token similarity pioneer the exploration of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mir-mu/softalign_objectives
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning