AlignFreeze: Navigating the Impact of Realignment on the Layers of   Multilingual Models Across Diverse Languages

Steve Bakos; F\'elix Gaschi; David Guzm\'an; Riddhi More; Kelly; Chutong Li; En-Shiun Annie Lee

arXiv:2502.12959·cs.CL·February 19, 2025

AlignFreeze: Navigating the Impact of Realignment on the Layers of Multilingual Models Across Diverse Languages

Steve Bakos, F\'elix Gaschi, David Guzm\'an, Riddhi More, Kelly, Chutong Li, En-Shiun Annie Lee

PDF

Open Access 1 Video

TL;DR

AlignFreeze is a novel method that selectively freezes layers during realignment in multilingual models, preventing performance degradation and improving PoS tagging accuracy across diverse languages.

Contribution

The paper introduces AlignFreeze, a new approach that improves cross-lingual transfer by selectively freezing layers during realignment in multilingual models.

Findings

01

Realignment impacts all layers but harms lower layers most.

02

Freezing lower layers prevents performance degradation.

03

AlignFreeze improves PoS tagging accuracy in multiple languages.

Abstract

Realignment techniques are often employed to enhance cross-lingual transfer in multilingual language models, still, they can sometimes degrade performance in languages that differ significantly from the fine-tuned source language. This paper introduces AlignFreeze, a method that freezes either the layers' lower half or upper half during realignment. Through controlled experiments on 4 tasks, 3 models, and in 35 languages, we find that realignment affects all the layers but can be the most detrimental to the lower ones. Freezing the lower layers can prevent performance degradation. Particularly, AlignFreeze improves Part-of-Speech (PoS) tagging performances in languages where full realignment fails: with XLM-R, it provides improvements of more than one standard deviation in accuracy in seven more languages than full realignment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AlignFreeze: Navigating the Impact of Realignment on the Layers of Multilingual Models Across Diverse Languages· underline

Taxonomy

TopicsNatural Language Processing Techniques