A Survey of Large Language Models for Code: Evolution, Benchmarking, and   Future Trends

Zibin Zheng; Kaiwen Ning; Yanlin Wang; Jingwen Zhang; Dewu; Zheng; Mingxi Ye; Jiachi Chen

arXiv:2311.10372·cs.SE·January 9, 2024·20 cites

A Survey of Large Language Models for Code: Evolution, Benchmarking, and Future Trends

Zibin Zheng, Kaiwen Ning, Yanlin Wang, Jingwen Zhang, Dewu, Zheng, Mingxi Ye, Jiachi Chen

PDF

Open Access

TL;DR

This survey comprehensively analyzes the evolution, benchmarking, and future trends of large language models specialized for code, highlighting their performance, relationships, and development directions in software engineering tasks.

Contribution

It systematically categorizes Code LLMs, compares their performance with general LLMs, and offers insights for future development and application in software engineering.

Findings

01

Code LLMs are often derived from general LLMs through fine-tuning.

02

Performance of Code LLMs varies across different software engineering tasks.

03

Certain LLMs outperform others on specific benchmarks and tasks.

Abstract

General large language models (LLMs), represented by ChatGPT, have demonstrated significant potential in tasks such as code generation in software engineering. This has led to the development of specialized LLMs for software engineering, known as Code LLMs. A considerable portion of Code LLMs is derived from general LLMs through model fine-tuning. As a result, Code LLMs are often updated frequently and their performance can be influenced by the base LLMs. However, there is currently a lack of systematic investigation into Code LLMs and their performance. In this study, we conduct a comprehensive survey and analysis of the types of Code LLMs and their differences in performance compared to general LLMs. We aim to address three questions: (1) What LLMs are specifically designed for software engineering tasks, and what is the relationship between these Code LLMs? (2) Do Code LLMs really…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research · Ferroelectric and Negative Capacitance Devices