Technical Report: Full-Stack Fine-Tuning for the Q Programming Language

Brendan R. Hogan; Will Brown; Adel Boyarsky; Anderson Schneider; Yuriy Nevmyvaka

arXiv:2508.06813·cs.LG·August 18, 2025

Technical Report: Full-Stack Fine-Tuning for the Q Programming Language

Brendan R. Hogan, Will Brown, Adel Boyarsky, Anderson Schneider, Yuriy Nevmyvaka

PDF

10 Models

TL;DR

This paper presents a comprehensive approach to adapt large language models for the Q programming language, including dataset creation, benchmarking, and training, achieving state-of-the-art accuracy in a specialized domain.

Contribution

It introduces a new dataset, benchmarks models, and develops a full training pipeline for fine-tuning LLMs on the niche Q language, outperforming existing models.

Findings

01

Best model achieves 59% pass@1 accuracy on Q benchmark.

02

All models outperform GPT-4.1 on the task.

03

Models surpass Claude Opus-4 by 29.5% in accuracy.

Abstract

Even though large language models are becoming increasingly capable, it is still unreasonable to expect them to excel at tasks that are under-represented on the Internet. Leveraging LLMs for specialized applications, particularly in niche programming languages and private domains, remains challenging and largely unsolved. In this work, we address this gap by presenting a comprehensive, open-source approach for adapting LLMs to the Q programming language, a popular tool in quantitative finance that is much less present on the Internet compared to Python, C, Java, and other ``mainstream" languages and is therefore not a strong suit of general-purpose AI models. We introduce a new Leetcode style evaluation dataset for Q, benchmark major frontier models on the dataset, then do pretraining, supervised fine tuning, and reinforcement learning to train a suite of reasoning and non-reasoning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.