Better Context Makes Better Code Language Models: A Case Study on   Function Call Argument Completion

Hengzhi Pei; Jinman Zhao; Leonard Lausen; Sheng Zha; George Karypis

arXiv:2306.00381·cs.SE·June 2, 2023·2 cites

Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion

Hengzhi Pei, Jinman Zhao, Leonard Lausen, Sheng Zha, George Karypis

PDF

Open Access 1 Video

TL;DR

This paper demonstrates that incorporating non-local code context via program analyzers significantly improves function call argument completion in pretrained code language models, addressing limitations of local-context-only approaches.

Contribution

It introduces a new dataset with full project context and external dependencies, and explores methods to integrate program analyzer information into code completion models.

Findings

01

Providing access to function implementation improves argument prediction.

02

Using function usage data enhances model performance.

03

Different methods of integrating analyzer info affect results.

Abstract

Pretrained code language models have enabled great progress towards program synthesis. However, common approaches only consider in-file local context and thus miss information and constraints imposed by other parts of the codebase and its external dependencies. Existing code completion benchmarks also lack such context. To resolve these restrictions we curate a new dataset of permissively licensed Python packages that includes full projects and their dependencies and provide tools to extract non-local information with the help of program analyzers. We then focus on the task of function call argument completion which requires predicting the arguments to function calls. We show that existing code completion models do not yield good results on our completion task. To better solve this task, we query a program analyzer for information relevant to a given function call, and consider ways to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion· underline

Taxonomy

TopicsSoftware Engineering Research · Computational Physics and Python Applications · Software System Performance and Reliability

MethodsFocus