Improving Multi-lingual Alignment Through Soft Contrastive Learning

Minsu Park; Seyeon Choi; Chanyeol Choi; Jun-Seong Kim; Jy-yong Sohn

arXiv:2405.16155·cs.CL·May 29, 2024

Improving Multi-lingual Alignment Through Soft Contrastive Learning

Minsu Park, Seyeon Choi, Chanyeol Choi, Jun-Seong Kim, Jy-yong Sohn

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel soft contrastive learning method for multi-lingual sentence embeddings, leveraging a pre-trained mono-lingual model to improve alignment and outperform existing models in cross-lingual tasks.

Contribution

It proposes a new soft contrastive loss approach for multi-lingual embedding alignment using sentence similarity from mono-lingual models, enhancing performance over traditional methods.

Findings

01

Outperforms conventional contrastive loss with hard labels.

02

Achieves superior results on bitext mining and STS benchmarks.

03

Outperforms LaBSE on Tatoeba dataset.

Abstract

Making decent multi-lingual sentence representations is critical to achieve high performances in cross-lingual downstream tasks. In this work, we propose a novel method to align multi-lingual embeddings based on the similarity of sentences measured by a pre-trained mono-lingual embedding model. Given translation sentence pairs, we train a multi-lingual model in a way that the similarity between cross-lingual embeddings follows the similarity of sentences measured at the mono-lingual teacher model. Our method can be considered as contrastive learning with soft labels defined as the similarity between sentences. Our experimental results on five languages show that our contrastive loss with soft labels far outperforms conventional contrastive loss with hard labels in various benchmarks for bitext mining tasks and STS tasks. In addition, our method outperforms existing multi-lingual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yai12xlinq-b/imascl
pytorchOfficial

Videos

Improving Multi-lingual Alignment Through Soft Contrastive Learning· underline

Taxonomy

TopicsSocioeconomic Development in MENA

MethodsALIGN · Contrastive Learning