On Iterative Evaluation and Enhancement of Code Quality Using GPT-4o
Rundong Liu, Andre Frade, Amal Vaidya, Maxime Labonne, Marcus Kaiser,, Bismayan Chakrabarti, Jonathan Budd, Sean Moran

TL;DR
CodeQUEST is a framework that uses GPT-4o to iteratively evaluate and improve code quality across multiple dimensions, demonstrating significant enhancements and close alignment with established metrics.
Contribution
Introduces CodeQUEST, a novel LLM-based framework for iterative code evaluation and enhancement across multiple quality dimensions, validated with experiments on Python and JavaScript.
Findings
Achieved a 52.6% mean relative improvement in code quality.
Evaluations closely align with established metrics like Pylint and Radon.
Demonstrated robustness and effectiveness in automated code quality enhancement.
Abstract
This paper introduces CodeQUEST, a novel framework leveraging Large Language Models (LLMs) to iteratively evaluate and enhance code quality across multiple dimensions, including readability, maintainability, efficiency, and security. The framework is divided into two main components: an Evaluator that assesses code quality across ten dimensions, providing both quantitative scores and qualitative summaries, and an Optimizer that iteratively improves the code based on the Evaluator's feedback. Our study demonstrates that CodeQUEST can effectively and robustly evaluate code quality, with its assessments aligning closely with established code quality metrics. Through a series of experiments using a curated dataset of Python and JavaScript examples, CodeQUEST demonstrated significant improvements in code quality, achieving a mean relative percentage improvement of 52.6%. The framework's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Numerical Methods and Algorithms
MethodsSparse Evolutionary Training
