Optimizing Large Language Models for OpenAPI Code Completion

Bohdan Petryshyn; Mantas Luko\v{s}evi\v{c}ius

arXiv:2405.15729·cs.SE·June 12, 2024

Optimizing Large Language Models for OpenAPI Code Completion

Bohdan Petryshyn, Mantas Luko\v{s}evi\v{c}ius

PDF

Open Access 2 Repos 1 Datasets

TL;DR

This paper evaluates and improves OpenAPI code completion in large language models by fine-tuning open-source models and proposing a new benchmark, achieving significant accuracy gains over commercial tools with fewer parameters.

Contribution

It introduces task-specific optimizations for open-source LLMs, a new semantics-aware OpenAPI benchmark, and enhancements to code infilling techniques for better performance with smaller context sizes.

Findings

01

Fine-tuned Code Llama improves correctness by 55.2% over GitHub Copilot.

02

The fine-tuned model uses 25 times fewer parameters than Codex.

03

Proposed enhancements address context size underperformance.

Abstract

Recent advancements in Large Language Models (LLMs) and their utilization in code generation tasks have significantly reshaped the field of software development. Despite the remarkable efficacy of code completion solutions in mainstream programming languages, their performance lags when applied to less ubiquitous formats such as OpenAPI definitions. This study evaluates the OpenAPI completion performance of GitHub Copilot, a prevalent commercial code completion tool, and proposes a set of task-specific optimizations leveraging Meta's open-source model Code Llama. A semantics-aware OpenAPI completion benchmark proposed in this research is used to perform a series of experiments through which the impact of various prompt-engineering and fine-tuning techniques on the Code Llama model's performance is analyzed. The fine-tuned Code Llama model reaches a peak correctness improvement of 55.2%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

BohdanPetryshyn/openapi-completion-refined
dataset· 54 dl
54 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsSparse Evolutionary Training · LLaMA