Do Advanced Language Models Eliminate the Need for Prompt Engineering in Software Engineering?
Guoqing Wang, Zeyu Sun, Zhihao Gong, Sixiang Ye, Yizhou Chen, Yifan, Zhao, Qingyuan Liang, Dan Hao

TL;DR
This study empirically evaluates whether advanced LLMs eliminate the need for prompt engineering in software engineering tasks, finding that simpler prompts often suffice and reasoning models excel mainly in complex tasks.
Contribution
It provides a comprehensive empirical analysis of prompt engineering effectiveness across advanced LLMs in software engineering, highlighting when complex prompts are beneficial or unnecessary.
Findings
Prompt engineering benefits diminish with advanced models.
Reasoning models excel in complex tasks but add little in simple ones.
Simple zero-shot prompts can outperform complex engineered prompts.
Abstract
Large Language Models (LLMs) have significantly advanced software engineering (SE) tasks, with prompt engineering techniques enhancing their performance in code-related areas. However, the rapid development of foundational LLMs such as the non-reasoning model GPT-4o and the reasoning model o1 raises questions about the continued effectiveness of these prompt engineering techniques. This paper presents an extensive empirical study that reevaluates various prompt engineering techniques within the context of these advanced LLMs. Focusing on three representative SE tasks, i.e., code generation, code translation, and code summarization, we assess whether prompt engineering techniques still yield improvements with advanced models, the actual effectiveness of reasoning models compared to non-reasoning models, and whether the benefits of using these advanced models justify their increased…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Software Engineering Research · Software Reliability and Analysis Research
