CGEMs: A Metric Model for Automatic Code Generation using GPT-3
Aishwarya Narasimhan (1), Krishna Prasad Agara Venkatesha Rao (2),, Veena M B (1) ((1) B M S College of Engineering, (2) Sony India Software, Centre Pvt. Ltd.)

TL;DR
This paper proposes CGEMs, a metric model for evaluating AI-generated code quality using GPT-3, combining static and NLP metrics, and employs a neural network for classification with interpretability features.
Contribution
It introduces a novel metric model for assessing AI-generated code quality and demonstrates its effectiveness with a neural network classifier.
Findings
Achieved 76.92% classification accuracy.
F1 score of 55.56% for code quality assessment.
Metrics effectively distinguish acceptable from unacceptable code.
Abstract
Today, AI technology is showing its strengths in almost every industry and walks of life. From text generation, text summarization, chatbots, NLP is being used widely. One such paradigm is automatic code generation. An AI could be generating anything; hence the output space is unconstrained. A self-driving car is driven for 100 million miles to validate its safety, but tests cannot be written to monitor and cover an unconstrained space. One of the solutions to validate AI-generated content is to constrain the problem and convert it from abstract to realistic, and this can be accomplished by either validating the unconstrained algorithm using theoretical proofs or by using Monte-Carlo simulation methods. In this case, we use the latter approach to test/validate a statistically significant number of samples. This hypothesis of validating the AI-generated code is the main motive of this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Topic Modeling
MethodsAttention Is All You Need · Linear Layer · Cosine Annealing · Layer Normalization · Linear Warmup With Cosine Annealing · Adam · {Dispute@FaQ-s}How to file a dispute with Expedia? · Refunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Weight Decay
