A Survey on LLM-based Code Generation for Low-Resource and Domain-Specific Programming Languages
Sathvik Joel, Jie JW Wu, Fatemeh H. Fard

TL;DR
This survey reviews the current state, challenges, and methodologies of using Large Language Models for code generation in low-resource and domain-specific programming languages, highlighting gaps and future directions.
Contribution
It systematically analyzes existing research, categorizes improvement strategies, and identifies the lack of standard benchmarks for LLM-based code generation in LRPLs and DSLs.
Findings
Identified four main evaluation techniques for code generation.
Categorized six groups of improvement methods.
Highlighted the absence of standard benchmarks and datasets.
Abstract
Large Language Models (LLMs) have shown impressive capabilities in code generation for popular programming languages. However, their performance on Low-Resource Programming Languages (LRPLs) and Domain-Specific Languages (DSLs) remains a significant challenge, affecting millions of developers-3.5 million users in Rust alone-who cannot fully utilize LLM capabilities. LRPLs and DSLs encounter unique obstacles, including data scarcity and, for DSLs, specialized syntax that is poorly represented in general-purpose datasets. Addressing these challenges is crucial, as LRPLs and DSLs enhance development efficiency in specialized domains, such as finance and science. While several surveys discuss LLMs in software engineering, none focus specifically on the challenges and opportunities associated with LRPLs and DSLs. Our survey fills this gap by systematically reviewing the current state,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbedded Systems Design Techniques · Parallel Computing and Optimization Techniques · Software Testing and Debugging Techniques
MethodsFocus
