Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
Yongxin Huang, Kexin Wang, Goran Glava\v{s}, Iryna Gurevych

TL;DR
This paper proposes a modular training approach for multilingual sentence encoders that separates language-specific monolingual modules from cross-lingual alignment adapters, improving performance and balance across tasks and languages.
Contribution
It introduces a modular training framework that mitigates multilinguality issues by decoupling monolingual and cross-lingual training, especially benefiting low-resource languages.
Findings
Improved performance on semantic similarity and relatedness tasks.
Enhanced zero-shot transfer capabilities for sentence classification.
Better balance across monolingual and cross-lingual tasks.
Abstract
Multilingual sentence encoders (MSEs) are commonly obtained by training multilingual language models to map sentences from different languages into a shared semantic space. As such, they are subject to curse of multilinguality, a loss of monolingual representational accuracy due to parameter sharing. Another limitation of MSEs is the trade-off between different task performance: cross-lingual alignment training distorts the optimal monolingual structure of semantic spaces of individual languages, harming the utility of sentence embeddings in monolingual tasks; cross-lingual tasks, such as cross-lingual semantic similarity and zero-shot transfer for sentence classification, may also require conflicting cross-lingual alignment strategies. In this work, we address both issues by means of modular training of sentence encoders. We first train language-specific monolingual modules to mitigate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsAdapter · ALIGN · Contrastive Learning
