Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Ayomide Odumakinde, Daniel D'souza, Pat Verga, Beyza Ermis, Sara, Hooker

TL;DR
This paper introduces multilingual arbitrage, a method that leverages performance differences among multiple models to improve multilingual data generation, resulting in significant performance gains especially for low-resource languages.
Contribution
The work proposes a novel multilingual arbitrage technique that uses multiple models to enhance data quality and model performance across diverse languages.
Findings
Up to 56.5% improvement in win rates across languages.
Significant gains for low-resource languages.
Outperforms single-teacher approaches.
Abstract
The use of synthetic data has played a critical role in recent state-of-art breakthroughs. However, overly relying on a single oracle teacher model to generate data has been shown to lead to model collapse and invite propagation of biases. These limitations are particularly evident in multilingual settings, where the absence of a universally effective teacher model that excels across all languages presents significant challenges. In this work, we address these extreme difference by introducing "multilingual arbitrage", which capitalizes on performance variations between multiple models for a given language. To do so, we strategically route samples through a diverse pool of models, each with unique strengths in different languages. Across exhaustive experiments on state-of-art models, our work suggests that arbitrage techniques allow for spectacular gains in performance that far…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗CohereLabs/aya-expanse-8bmodel· 16k dl· ♡ 42316k dl♡ 423
- 🤗CohereLabs/aya-expanse-32bmodel· 6.7k dl· ♡ 2896.7k dl♡ 289
- 🤗jth01/aya-expanse-8b-5.0bpw-exl2model· 2 dl2 dl
- 🤗lucyknada/CohereForAI_aya-expanse-8b-exl2model· ♡ 2♡ 2
- 🤗duyntnet/aya-expanse-8b-imatrix-GGUFmodel· 47 dl47 dl
- 🤗lucyknada/CohereForAI_aya-expanse-32b-exl2model· ♡ 2♡ 2
- 🤗Andrewwwwww/aya-expanse-32bmodel· 3 dl3 dl
- 🤗Svngoku/Aya-Expanse-8B-Frenchmodel· 2 dl2 dl
- 🤗QuantFactory/aya-expanse-8b-GGUFmodel· 194 dl· ♡ 5194 dl♡ 5
- 🤗duyntnet/aya-expanse-32b-imatrix-GGUFmodel· 62 dl62 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultilingual Education and Policy
