SERA: Soft-Verified Efficient Repository Agents
Ethan Shen, Danny Tormoen, Saurabh Shah, Ali Farhadi, Tim Dettmers

TL;DR
SERA introduces a cost-effective, supervised fine-tuning method for creating specialized open-source coding agents capable of private codebase adaptation, matching state-of-the-art performance with significantly reduced training costs.
Contribution
The paper presents SERA, a novel training approach that enables rapid, cheap, and effective specialization of open-source coding agents to private codebases using supervised fine-tuning.
Findings
SERA achieves state-of-the-art results among open-source models.
Creating SERA is 26x cheaper than reinforcement learning.
Generating synthetic trajectories enables large-scale codebase training.
Abstract
Open-weight coding agents should hold a fundamental advantage over closed-source systems: they can be specialized to private codebases, encoding repository-specific information directly in their weights. Yet the cost and complexity of training has kept this advantage theoretical. We show it is now practical. We present Soft-Verified Efficient Repository Agents (SERA), an efficient method for training coding agents that enables the rapid and cheap creation of agents specialized to private codebases. Using only supervised finetuning (SFT), SERA achieves state-of-the-art results among fully open-source (open data, method, code) models while matching the performance of frontier open-weight models like Devstral-Small-2. Creating SERA models is 26x cheaper than reinforcement learning and 57x cheaper than previous synthetic data methods to reach equivalent performance. Our method, Soft…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗allenai/SERA-32Bmodel· 444 dl· ♡ 110444 dl♡ 110
- 🤗allenai/SERA-32B-GAmodel· 16 dl· ♡ 2116 dl♡ 21
- 🤗allenai/SERA-8B-GAmodel· 13 dl· ♡ 1413 dl♡ 14
- 🤗allenai/SERA-8Bmodel· 428 dl· ♡ 40428 dl♡ 40
- 🤗allenai/SERA-14Bmodel· 243 dl· ♡ 8243 dl♡ 8
- 🤗cyankiwi/SERA-32B-AWQ-4bitmodel· 6 dl6 dl
- 🤗cyankiwi/SERA-32B-AWQ-8bitmodel· 4 dl4 dl
- 🤗ryanfortin/community-blend-qwen3-8bmodel· 7 dl7 dl
- 🤗davanstrien/setfit-hf-dataset-domain-v0model· 29 dl29 dl
- 🤗SunsBp/sera-14b-patchedmodel· 333 dl333 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Software Engineering Research · Ethics and Social Impacts of AI
