Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

Aleksander Boruch-Gruszecki; Yangtian Zi; Zixuan Wu; Tejas Oberoi; Carolyn Jane Anderson; Joydeep Biswas; Arjun Guha

arXiv:2508.04865·cs.LG·March 24, 2026

Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

Aleksander Boruch-Gruszecki, Yangtian Zi, Zixuan Wu, Tejas Oberoi, Carolyn Jane Anderson, Joydeep Biswas, Arjun Guha

PDF

TL;DR

Agnostics introduces a universal reinforcement learning pipeline that enables effective code generation in any programming language by focusing on observable behavior, reducing language-specific engineering efforts.

Contribution

It presents a language-agnostic post-training method using verifiable rewards, applicable across diverse models and languages, with publicly released datasets and configurations.

Findings

01

Achieves performance comparable to larger models in low-resource languages.

02

Scales effectively to larger and diverse model families.

03

Sets new state-of-the-art pass@1 results on multi-language benchmarks.

Abstract

Large language models (LLMs) already excel at writing code in high-resource languages such as Python and JavaScript, yet stumble on low-resource languages that remain essential to science and engineering. Besides the obvious shortage of pre-training data, post-training itself is a bottleneck: every new language seems to require new datasets, test harnesses, and reinforcement-learning (RL) infrastructure. We introduce Agnostics, a language-agnostic post-training pipeline that eliminates this per-language engineering. The key idea is to judge code solely by its externally observable behavior, so a single verifier can test solutions written in any language. Concretely, we (i) use an LLM to rewrite existing unit-test datasets into an I/O format, (ii) supply a short configuration that tells the verifier how to compile and run a target language, and (iii) apply reinforcement learning with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.