TL;DR
This paper presents a comprehensive approach to adapt large language models for the Q programming language, including dataset creation, benchmarking, and training, achieving state-of-the-art accuracy in a specialized domain.
Contribution
It introduces a new dataset, benchmarks models, and develops a full training pipeline for fine-tuning LLMs on the niche Q language, outperforming existing models.
Findings
Best model achieves 59% pass@1 accuracy on Q benchmark.
All models outperform GPT-4.1 on the task.
Models surpass Claude Opus-4 by 29.5% in accuracy.
Abstract
Even though large language models are becoming increasingly capable, it is still unreasonable to expect them to excel at tasks that are under-represented on the Internet. Leveraging LLMs for specialized applications, particularly in niche programming languages and private domains, remains challenging and largely unsolved. In this work, we address this gap by presenting a comprehensive, open-source approach for adapting LLMs to the Q programming language, a popular tool in quantitative finance that is much less present on the Internet compared to Python, C, Java, and other ``mainstream" languages and is therefore not a strong suit of general-purpose AI models. We introduce a new Leetcode style evaluation dataset for Q, benchmark major frontier models on the dataset, then do pretraining, supervised fine tuning, and reinforcement learning to train a suite of reasoning and non-reasoning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗morganstanley/qqWen-1.5B-SFTmodel· 5 dl· ♡ 45 dl♡ 4
- 🤗morganstanley/qqWen-3B-RLmodel· 14 dl· ♡ 214 dl♡ 2
- 🤗morganstanley/qqWen-7B-RLmodel· 2 dl2 dl
- 🤗morganstanley/qqWen-14B-RL-Reasoningmodel· 5 dl· ♡ 25 dl♡ 2
- 🤗morganstanley/qqWen-32B-RL-Reasoningmodel· 6 dl· ♡ 66 dl♡ 6
- 🤗morganstanley/qqWen-32B-Pretrainmodel· 6 dl· ♡ 16 dl♡ 1
- 🤗morganstanley/qqWen-32B-SFTmodel· 14 dl14 dl
- 🤗morganstanley/qqWen-7B-pretrainmodel· 582 dl582 dl
- 🤗morganstanley/qqWen-7B-sftmodel· 85 dl· ♡ 185 dl♡ 1
- 🤗morganstanley/qqWen-14B-sftmodel· 16 dl16 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
