compar:IA: The French Government's LLM arena to collect French-language human prompts and preference data

Lucie Termignon; Simonas Zilinskas; Hadrien P\'elissier; Aur\'elien Barrot; Nicolas Chesnais; Elie Gavoty

arXiv:2602.06669·cs.CL·February 9, 2026

compar:IA: The French Government's LLM arena to collect French-language human prompts and preference data

Lucie Termignon, Simonas Zilinskas, Hadrien P\'elissier, Aur\'elien Barrot, Nicolas Chesnais, Elie Gavoty

PDF

Open Access

TL;DR

The paper introduces compar:IA, a French government-developed platform that collects large-scale French-language human preference data to improve multilingual LLMs, with open datasets and initial analyses.

Contribution

It presents an open-source infrastructure for collecting and analyzing French-language human preferences, addressing data scarcity in non-English LLM training.

Findings

01

Collected over 600,000 prompts and 250,000 preference votes

02

Released open datasets including conversations, votes, and reactions

03

Developed a French-language model leaderboard and analyzed user interaction patterns

Abstract

Large Language Models (LLMs) often show reduced performance, cultural alignment, and safety robustness in non-English languages, partly because English dominates both pre-training data and human preference alignment datasets. Training methods like Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO) require human preference data, which remains scarce and largely non-public for many languages beyond English. To address this gap, we introduce compar:IA, an open-source digital public service developed inside the French government and designed to collect large-scale human preference data from a predominantly French-speaking general audience. The platform uses a blind pairwise comparison interface to capture unconstrained, real-world prompts and user judgments across a diverse set of language models, while maintaining low participation friction and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Multimodal Machine Learning Applications · Ethics and Social Impacts of AI