Improving Cross-lingual Information Retrieval on Low-Resource Languages   via Optimal Transport Distillation

Zhiqi Huang; Puxuan Yu; James Allan

arXiv:2301.12566·cs.CL·January 31, 2023

Improving Cross-lingual Information Retrieval on Low-Resource Languages via Optimal Transport Distillation

Zhiqi Huang, Puxuan Yu, James Allan

PDF

TL;DR

This paper introduces OPTICAL, a novel optimal transport-based distillation method that improves cross-lingual retrieval for low-resource languages by leveraging monolingual models and minimal data.

Contribution

The paper proposes a new distillation approach using optimal transport to transfer knowledge from high-resource to low-resource languages in cross-lingual retrieval.

Findings

01

OPTICAL outperforms strong baselines on low-resource languages.

02

The method requires only minimal bitext data for training.

03

Significant improvements are achieved in low-resource language retrieval tasks.

Abstract

Benefiting from transformer-based pre-trained language models, neural ranking models have made significant progress. More recently, the advent of multilingual pre-trained language models provides great support for designing neural cross-lingual retrieval models. However, due to unbalanced pre-training data in different languages, multilingual language models have already shown a performance gap between high and low-resource languages in many downstream tasks. And cross-lingual retrieval models built on such pre-trained models can inherit language bias, leading to suboptimal result for low-resource languages. Moreover, unlike the English-to-English retrieval task, where large-scale training collections for document ranking such as MS MARCO are available, the lack of cross-lingual retrieval data for low-resource language makes it more challenging for training cross-lingual retrieval…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.