compar:IA: The French Government's LLM arena to collect French-language human prompts and preference data
Lucie Termignon, Simonas Zilinskas, Hadrien P\'elissier, Aur\'elien Barrot, Nicolas Chesnais, Elie Gavoty

TL;DR
The paper introduces compar:IA, a French government-developed platform that collects large-scale French-language human preference data to improve multilingual LLMs, with open datasets and initial analyses.
Contribution
It presents an open-source infrastructure for collecting and analyzing French-language human preferences, addressing data scarcity in non-English LLM training.
Findings
Collected over 600,000 prompts and 250,000 preference votes
Released open datasets including conversations, votes, and reactions
Developed a French-language model leaderboard and analyzed user interaction patterns
Abstract
Large Language Models (LLMs) often show reduced performance, cultural alignment, and safety robustness in non-English languages, partly because English dominates both pre-training data and human preference alignment datasets. Training methods like Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO) require human preference data, which remains scarce and largely non-public for many languages beyond English. To address this gap, we introduce compar:IA, an open-source digital public service developed inside the French government and designed to collect large-scale human preference data from a predominantly French-speaking general audience. The platform uses a blind pairwise comparison interface to capture unconstrained, real-world prompts and user judgments across a diverse set of language models, while maintaining low participation friction and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Multimodal Machine Learning Applications · Ethics and Social Impacts of AI
