Prompt engineering and framework: implementation to increase code reliability based guideline for LLMs

Rogelio Cruz; Jonatan Contreras; Francisco Guerrero; Ezequiel Rodriguez; Carlos Valdez; Citlali Carrillo

arXiv:2506.10989·cs.SE·June 16, 2025

Prompt engineering and framework: implementation to increase code reliability based guideline for LLMs

Rogelio Cruz, Jonatan Contreras, Francisco Guerrero, Ezequiel Rodriguez, Carlos Valdez, Citlali Carrillo

PDF

Open Access

TL;DR

This paper presents a novel prompt engineering framework that significantly improves the accuracy and efficiency of Python code generation by LLMs, outperforming existing methods in quality and resource usage.

Contribution

The paper introduces a new prompt template that enhances code correctness and reduces token consumption, advancing the state-of-the-art in LLM-based code generation.

Findings

01

Outperforms zero-shot and CoT methods in Pass@k metric

02

Reduces token usage significantly compared to CoT

03

Enhances code reliability and resource efficiency

Abstract

In this paper, we propose a novel prompting approach aimed at enhancing the ability of Large Language Models (LLMs) to generate accurate Python code. Specifically, we introduce a prompt template designed to improve the quality and correctness of generated code snippets, enabling them to pass tests and produce reliable results. Through experiments conducted on two state-of-the-art LLMs using the HumanEval dataset, we demonstrate that our approach outperforms widely studied zero-shot and Chain-of-Thought (CoT) methods in terms of the Pass@k metric. Furthermore, our method achieves these improvements with significantly reduced token usage compared to the CoT approach, making it both effective and resource-efficient, thereby lowering the computational demands and improving the eco-footprint of LLM capabilities. These findings highlight the potential of tailored prompting strategies to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Software Reliability and Analysis Research · Software Testing and Debugging Techniques