Sequence Model Design for Code Completion in the Modern IDE

Gareth Ari Aye; Gail E. Kaiser

arXiv:2004.05249·cs.SE·April 14, 2020·20 cites

Sequence Model Design for Code Completion in the Modern IDE

Gareth Ari Aye, Gail E. Kaiser

PDF

Open Access

TL;DR

This paper introduces a novel neural language model for code completion that ensures valid, typecheckable suggestions, delivers instant results, and is optimized for deployment in modern IDEs, outperforming previous models in accuracy.

Contribution

The paper presents a new model combining static analysis with neural predictions, addressing practical constraints of IDEs and improving code completion accuracy and efficiency.

Findings

01

Achieves state-of-the-art accuracy in source code modeling

02

Ensures suggestions are always valid and typecheckable

03

Fits within the resource constraints of modern IDEs

Abstract

Code completion plays a prominent role in modern integrated development environments (IDEs). Machine learning has become ubiquitous in analogous natural language writing and search software, surfacing more relevant autocompletions and search suggestions in fewer keystrokes. Prior research has reported training high-accuracy, deep neural networks for modeling source code, but little attention has been given to the practical constraints imposed by interactive developer tools. In particular, neural language models for source code modeling like the one described in Maybe Deep Neural Networks are the Best Choice for Modeling Source Code are framed around code completion, but only report accuracy of next-token prediction. However, in order for a language model (LM) to work well within real-world code completion systems, it must also always make suggestions that produce valid code that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software System Performance and Reliability