Exploring Alignment in Shared Cross-lingual Spaces

Basel Mousi; Nadir Durrani; Fahim Dalvi; Majd Hawasly and; Ahmed Abdelali

arXiv:2405.14535·cs.CL·May 24, 2024

Exploring Alignment in Shared Cross-lingual Spaces

Basel Mousi, Nadir Durrani, Fahim Dalvi, Majd Hawasly and, Ahmed Abdelali

PDF

Open Access 1 Repo 2 Videos

TL;DR

This paper investigates how multilingual models align concepts across languages in their latent spaces, introducing metrics to quantify this alignment and analyzing models like mT5, mBERT, and XLM-R across various tasks.

Contribution

It introduces two metrics, extit{CA} and extit{CO}, to measure concept alignment and overlap in multilingual embeddings, and analyzes how these evolve with model depth and fine-tuning.

Findings

01

Deeper layers show increased cross-lingual alignment.

02

Fine-tuning enhances alignment in latent space.

03

Task-specific calibration explains zero-shot capabilities.

Abstract

Despite their remarkable ability to capture linguistic nuances across diverse languages, questions persist regarding the degree of alignment between languages in multilingual embeddings. Drawing inspiration from research on high-dimensional representations in neural language models, we employ clustering to uncover latent concepts within multilingual models. Our analysis focuses on quantifying the \textit{alignment} and \textit{overlap} of these concepts across various languages within the latent space. To this end, we introduce two metrics \CA{} and \CO{} aimed at quantifying these aspects, enabling a deeper exploration of multilingual embeddings. Our study encompasses three multilingual models (\texttt{mT5}, \texttt{mBERT}, and \texttt{XLM-R}) and three downstream tasks (Machine Translation, Named Entity Recognition, and Sentiment Analysis). Key findings from our analysis include: i)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qcri/multilingual-latent-concepts
noneOfficial

Videos

Exploring Alignment in Shared Cross-lingual Spaces· underline

Taxonomy

TopicsSpeech and dialogue systems