Comparative Analysis of Large Language Models for Context-Aware Code   Completion using SAFIM Framework

Hang Zhang; Yanxin Shen; Lun Wang; Chuanqi Shi; Shaoshuai Du; Yiyi; Tao; Yixian Shen

arXiv:2502.15243·cs.SE·February 24, 2025

Comparative Analysis of Large Language Models for Context-Aware Code Completion using SAFIM Framework

Hang Zhang, Yanxin Shen, Lun Wang, Chuanqi Shi, Shaoshuai Du, Yiyi, Tao, Yixian Shen

PDF

TL;DR

This paper compares the performance of various large language models in syntax-aware code completion using the SAFIM benchmark, highlighting differences in accuracy and efficiency to guide future development.

Contribution

It introduces a comprehensive benchmark for evaluating LLMs in syntax-sensitive code completion and provides a detailed comparative analysis of multiple models.

Findings

01

Significant performance differences among models.

02

Trade-offs between accuracy and latency.

03

Benchmark establishes a standard for future research.

Abstract

The advent of Large Language Models (LLMs) has revolutionized code completion, transforming it into a more intelligent and context-aware feature in modern integrated development environments. These advancements have significantly enhanced developers' ability to write efficient and error-free code. This study evaluates the performance of several chat-based LLMs, including Gemini 1.5 Flash, Gemini 1.5 Pro, GPT-4o, GPT-4o-mini, and GPT-4 Turbo, using the Syntax-Aware Fill-in-the-Middle (SAFIM) dataset. This benchmark is specifically designed to assess models' capabilities in syntax-sensitive code generation. Performance metrics, such as cosine similarity with ground-truth completions and latency, were employed to measure both accuracy and efficiency. The findings reveal substantial differences in the models' code completion abilities, offering valuable insights into their respective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Absolute Position Encodings · Linear Layer · Layer Normalization · Byte Pair Encoding · Dense Connections · Residual Connection · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer