A Survey of Large Language Models for Code: Evolution, Benchmarking, and Future Trends
Zibin Zheng, Kaiwen Ning, Yanlin Wang, Jingwen Zhang, Dewu, Zheng, Mingxi Ye, Jiachi Chen

TL;DR
This survey comprehensively analyzes the evolution, benchmarking, and future trends of large language models specialized for code, highlighting their performance, relationships, and development directions in software engineering tasks.
Contribution
It systematically categorizes Code LLMs, compares their performance with general LLMs, and offers insights for future development and application in software engineering.
Findings
Code LLMs are often derived from general LLMs through fine-tuning.
Performance of Code LLMs varies across different software engineering tasks.
Certain LLMs outperform others on specific benchmarks and tasks.
Abstract
General large language models (LLMs), represented by ChatGPT, have demonstrated significant potential in tasks such as code generation in software engineering. This has led to the development of specialized LLMs for software engineering, known as Code LLMs. A considerable portion of Code LLMs is derived from general LLMs through model fine-tuning. As a result, Code LLMs are often updated frequently and their performance can be influenced by the base LLMs. However, there is currently a lack of systematic investigation into Code LLMs and their performance. In this study, we conduct a comprehensive survey and analysis of the types of Code LLMs and their differences in performance compared to general LLMs. We aim to address three questions: (1) What LLMs are specifically designed for software engineering tasks, and what is the relationship between these Code LLMs? (2) Do Code LLMs really…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Software Engineering Research · Ferroelectric and Negative Capacitance Devices
