Bridging LLM-Generated Code and Requirements: Reverse Generation   technique and SBC Metric for Developer Insights

Ahilan Ayyachamy Nadar Ponnusamy

arXiv:2502.07835·cs.SE·February 13, 2025·2 cites

Bridging LLM-Generated Code and Requirements: Reverse Generation technique and SBC Metric for Developer Insights

Ahilan Ayyachamy Nadar Ponnusamy

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel SBC scoring method that uses reverse generation to evaluate AI-generated code by reconstructing requirements and comparing them with original specifications, improving developer insights.

Contribution

Introduces the SBC score, a new evaluation metric combining semantic similarity, BLEU, and completeness, for better assessment of LLM-generated code quality.

Findings

01

SBC score correlates better with human judgment than traditional metrics.

02

Reverse generation effectively identifies missing features and hallucinations.

03

The approach enhances developer understanding of AI-generated code.

Abstract

The rise of Large Language Models (LLMs) in software engineering, particularly in code generation, has garnered significant attention. However, assessing the quality of AI-generated code remains a challenge due to the inherent complexity of programming tasks and the lack of robust evaluation metrics that align well with human judgment. Traditional token-based metrics such as BLEU and ROUGE, while commonly used in natural language processing, exhibit weak correlations with human assessments in code intelligence and verification tasks. Furthermore, these metrics are primarily research focused and are not designed for seamless integration into the software development lifecycle, limiting their practical utility for developers seeking to improve code quality and security. AI-assisted coding has been shown to be more beneficial for senior developers, as they possess the expertise to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AhilanPonnusamy/Reverse-Generation-and-SBC-Metric--Reference-Implementation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law

MethodsALIGN