Prompting and Fine-tuning Large Language Models for Automated Code   Review Comment Generation

Md. Asif Haider; Ayesha Binte Mostofa; Sk. Sabit Bin Mosaddek; Anindya; Iqbal; Toufique Ahmed

arXiv:2411.10129·cs.SE·November 18, 2024

Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation

Md. Asif Haider, Ayesha Binte Mostofa, Sk. Sabit Bin Mosaddek, Anindya, Iqbal, Toufique Ahmed

PDF

Open Access

TL;DR

This paper explores fine-tuning open-source large language models with parameter-efficient methods and augmenting prompts with semantic metadata to enhance automated code review comment generation, achieving significant performance improvements.

Contribution

It introduces a low-rank quantization fine-tuning approach for open-source LLMs and demonstrates the effectiveness of semantic metadata augmentation in prompting for code review tasks.

Findings

01

Function call graph augmentation with GPT-3.5 improves BLEU-4 scores by 90%.

02

Few-shot prompting of proprietary LLMs achieves 25-83% performance gains.

03

Human evaluation confirms improved quality of generated review comments.

Abstract

Generating accurate code review comments remains a significant challenge due to the inherently diverse and non-unique nature of the task output. Large language models pretrained on both programming and natural language data tend to perform well in code-oriented tasks. However, large-scale pretraining is not always feasible due to its environmental impact and project-specific generalizability issues. In this work, first we fine-tune open-source Large language models (LLM) in parameter-efficient, quantized low-rank (QLoRA) fashion on consumer-grade hardware to improve review comment generation. Recent studies demonstrate the efficacy of augmenting semantic metadata information into prompts to boost performance in other code-related tasks. To explore this in code review activities, we also prompt proprietary, closed-source LLMs augmenting the input code patch with function call graphs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Attention Dropout · Dense Connections · Softmax · Layer Normalization · Dropout · Cosine Annealing