Improving FIM Code Completions via Context & Curriculum Based Learning
Hitesh Sagtani, Rishabh Mehrotra, Beyang Liu

TL;DR
This paper enhances Fill-in-the-Middle code completion models by incorporating curriculum learning with hard-to-complete examples and context analysis, improving performance especially for smaller models under latency constraints.
Contribution
It introduces a curriculum-based training approach using hard patterns and context analysis to improve FIM code completion, balancing accuracy and latency.
Findings
Fine-tuned models outperform baseline in code completion tasks.
Curriculum learning enhances performance more for smaller models.
No latency impact observed during online testing.
Abstract
Fill-in-the-Middle (FIM) models play a vital role in code completion tasks, leveraging both prefix and suffix context to provide more accurate and contextually relevant suggestions. This paper presents approaches to improve FIM code completion while addressing the challenge of maintaining low latency for real-time coding assistance. We enhance FIM code completion by incorporating context and curriculum examples in the training process. We identify patterns where completion suggestions fail more frequently, revealing complexities that smaller language models struggle with. To address these challenges, we develop a curriculum dataset by extracting hard-to-complete patterns from code repositories and generate context examples using semantic and static analysis tools (e.g. TSC compiler). We fine-tune various sized models, including StarCoder and DeepSeek, on this enhanced dataset. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpen Education and E-Learning
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Cosine Annealing · Residual Connection · Adam · Weight Decay · Linear Warmup With Cosine Annealing · Layer Normalization · Discriminative Fine-Tuning · Linear Layer · Dropout
