Augmenting Large Language Models with Static Code Analysis for Automated Code Quality Improvements
Seyed Moein Abtahi, Akramul Azim

TL;DR
This paper presents a novel approach combining static code analysis, large language models, and retrieval-augmented generation to automate code issue detection and correction, significantly improving code quality and development efficiency.
Contribution
It introduces an integrated framework that leverages static analysis, LLMs, and RAG to automate code revisions and address hallucinations, advancing automated software maintenance.
Findings
Significant reduction in code issues after applying the framework
Enhanced accuracy of code revisions through retrieval-augmented generation
Effective mitigation of LLM hallucinations with a custom comparison tool
Abstract
This study examined code issue detection and revision automation by integrating Large Language Models (LLMs) such as OpenAI's GPT-3.5 Turbo and GPT-4o into software development workflows. A static code analysis framework detects issues such as bugs, vulnerabilities, and code smells within a large-scale software project. Detailed information on each issue was extracted and organized to facilitate automated code revision using LLMs. An iterative prompt engineering process is applied to ensure that prompts are structured to produce accurate and organized outputs aligned with the project requirements. Retrieval-augmented generation (RAG) is implemented to enhance the relevance and precision of the revisions, enabling LLM to access and integrate real-time external knowledge. The issue of LLM hallucinations - where the model generates plausible but incorrect outputs - is addressed by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Scientific Computing and Data Management · Software Testing and Debugging Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Cosine Annealing · {Dispute@FaQ-s}How to file a dispute with Expedia? · Layer Normalization · Linear Warmup With Linear Decay · Linear Warmup With Cosine Annealing · Attention Dropout · Byte Pair Encoding · Softmax · Linear Layer
