WarriorCoder: Learning from Expert Battles to Augment Code Large   Language Models

Huawen Feng; Pu Zhao; Qingfeng Sun; Can Xu; Fangkai Yang; Lu Wang,; Qianli Ma; Qingwei Lin; Saravan Rajmohan; Dongmei Zhang; Qi Zhang

arXiv:2412.17395·cs.CL·February 19, 2025

WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models

Huawen Feng, Pu Zhao, Qingfeng Sun, Can Xu, Fangkai Yang, Lu Wang,, Qianli Ma, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

PDF

Open Access 1 Models 1 Video

TL;DR

WarriorCoder introduces a competitive framework where expert code LLMs challenge each other to generate diverse training data, leading to state-of-the-art performance without relying on proprietary models.

Contribution

The paper presents a novel paradigm that uses expert battles among code LLMs to create diverse training data, reducing reliance on proprietary models and biases.

Findings

01

Achieves state-of-the-art performance among models of the same size.

02

Generates diverse training data from scratch through expert battles.

03

Does not rely on proprietary LLMs for data augmentation.

Abstract

Despite recent progress achieved by code large language models (LLMs), their remarkable abilities are largely dependent on fine-tuning on the high-quality data, posing challenges for data collection and annotation. To address this, current methods often design various data flywheels to collect complex code instructions, enabling models to handle more intricate tasks. However, these approaches typically rely on off-the-shelf datasets and data augmentation from a limited set of proprietary LLMs (e.g., Claude, GPT4, and so on), which restricts the diversity of the constructed data and makes it prone to systemic biases. In this paper, we propose WarriorCoder, a novel paradigm learns from expert battles to address these limitations. Specifically, we create an arena where leading expert code LLMs challenge each other, with evaluations conducted by impartial judges. This competitive framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
HuggingMicah/warriorcoder_reproduce
model· 2 dl· ♡ 7
2 dl♡ 7

Videos

WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education