DeFTX: Denoised Sparse Fine-Tuning for Zero-Shot Cross-Lingual Transfer
Sona Elza Simon, Preethi Jyothi

TL;DR
DeFT-X introduces a denoising step using singular value decomposition before sparse fine-tuning, significantly improving zero-shot cross-lingual transfer performance for low-resource languages.
Contribution
It proposes a novel denoised sparse fine-tuning method that enhances robustness and effectiveness in cross-lingual transfer tasks.
Findings
DeFT-X outperforms existing SFT methods on low-resource language benchmarks.
Denoising improves the robustness of sparse masks in cross-lingual transfer.
DeFT-X achieves comparable or better results than state-of-the-art baselines.
Abstract
Effective cross-lingual transfer remains a critical challenge in scaling the benefits of large language models from high-resource to low-resource languages. Towards this goal, prior studies have explored many approaches to combine task knowledge from task-specific data in a (high-resource) source language and language knowledge from unlabeled text in a (low-resource) target language. One notable approach proposed composable sparse fine-tuning (SFT) for cross-lingual transfer that learns task-specific and language-specific sparse masks to select a subset of the pretrained model's parameters that are further fine-tuned. These sparse fine-tuned vectors (SFTs) are subsequently composed with the pretrained model to facilitate zero-shot cross-lingual transfer to a task in a target language, using only task-specific data from a source language. These sparse masks for SFTs were identified using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling
MethodsShrink and Fine-Tune · Pruning · Sparse Evolutionary Training
