Planning-Aware Code Infilling via Horizon-Length Prediction

Yifeng Ding; Hantian Ding; Shiqi Wang; Qing Sun; Varun Kumar; Zijian Wang

arXiv:2410.03103·cs.LG·November 20, 2025

Planning-Aware Code Infilling via Horizon-Length Prediction

Yifeng Ding, Hantian Ding, Shiqi Wang, Qing Sun, Varun Kumar, Zijian Wang

PDF

Open Access 1 Video

TL;DR

This paper introduces Horizon-Length Prediction (HLP), a novel training method for code infilling models that improves their planning ability and alignment with context, leading to significant performance gains without extra inference costs.

Contribution

HLP is a new training objective that enables models to learn effective planning for code infilling by predicting remaining tokens, improving performance across various benchmarks.

Findings

01

HLP improves code infilling performance by up to 24%

02

Models with HLP show enhanced code reasoning capabilities

03

HLP adds negligible training overhead and no inference cost

Abstract

Fill-in-the-Middle (FIM), or infilling, has become integral to code language models, enabling generation of missing code given both left and right contexts. However, the current FIM training paradigm which performs next-token prediction (NTP) over reordered sequence often leads to models struggling to generate content that aligns well with the surrounding context. We hypothesize that NTP alone is insufficient for models to learn effective planning conditioned on the distant right context, a critical factor for successful code infilling. To overcome this, we propose Horizon-Length Prediction (HLP), a novel training objective that teaches models to predict the number of remaining middle tokens at each step. HLP advances FIM with lookahead planning, enabling models to inherently learn infilling boundaries for arbitrary left and right contexts without relying on dataset-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Planning-Aware Code Infilling via Horizon-Length Prediction· underline

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Manufacturing Process and Optimization · Advanced Measurement and Metrology Techniques