Sequence Model Design for Code Completion in the Modern IDE
Gareth Ari Aye, Gail E. Kaiser

TL;DR
This paper introduces a novel neural language model for code completion that ensures valid, typecheckable suggestions, delivers instant results, and is optimized for deployment in modern IDEs, outperforming previous models in accuracy.
Contribution
The paper presents a new model combining static analysis with neural predictions, addressing practical constraints of IDEs and improving code completion accuracy and efficiency.
Findings
Achieves state-of-the-art accuracy in source code modeling
Ensures suggestions are always valid and typecheckable
Fits within the resource constraints of modern IDEs
Abstract
Code completion plays a prominent role in modern integrated development environments (IDEs). Machine learning has become ubiquitous in analogous natural language writing and search software, surfacing more relevant autocompletions and search suggestions in fewer keystrokes. Prior research has reported training high-accuracy, deep neural networks for modeling source code, but little attention has been given to the practical constraints imposed by interactive developer tools. In particular, neural language models for source code modeling like the one described in Maybe Deep Neural Networks are the Best Choice for Modeling Source Code are framed around code completion, but only report accuracy of next-token prediction. However, in order for a language model (LM) to work well within real-world code completion systems, it must also always make suggestions that produce valid code that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software System Performance and Reliability
