Steerable Instruction Following Coding Data Synthesis with Actor-Parametric Schema Co-Evolution
Tinglin Huang, Bo Chen, Xiao Zhang, Kai Shen, Rex Ying

TL;DR
This paper introduces IFCodeEvolve, a novel framework for generating instruction-following coding data using actor-schema co-evolution, significantly improving model performance and providing a new benchmark.
Contribution
The paper presents a new actor-schema co-evolution approach for synthesizing instruction-paired coding data and introduces IFCodeBench, a verified benchmark for evaluation.
Findings
IFCodeEvolve boosts base model performance to SOTA levels.
The framework effectively navigates large instruction spaces with MCTS.
IFCodeEvolve's data synthesis improves instruction-following capabilities.
Abstract
Interpreting and following human instructions is a critical capability of large language models (LLMs) in automatic programming. However, synthesizing large-scale instruction-paired coding data remains largely unexplored and is particularly challenging when ensuring logical compatibility among multiple constraints. In this study, we propose IFCodeEvolve, an actor-schema co-evolution framework for instruction following coding data generation. By representing instructions as parametric function schema, we construct a library that covers the vast instruction space via dynamic constraint instantiation. Building upon this, Monte Carlo Tree Search (MCTS) sampler is applied to efficiently navigate this space, utilizing actor model feedback as a dynamic termination signal. Furthermore, to progressively explore challenging problems, we introduce a co-evolving paradigm that iteratively advances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
