Leveraging Large Language Models in Code Question Answering: Baselines   and Issues

Georgy Andryushchenko; Vladimir Ivanov; Vladimir Makharev; Elizaveta; Tukhtina; Aidar Valeev

arXiv:2411.03012·cs.CL·November 6, 2024

Leveraging Large Language Models in Code Question Answering: Baselines and Issues

Georgy Andryushchenko, Vladimir Ivanov, Vladimir Makharev, Elizaveta, Tukhtina, Aidar Valeev

PDF

Open Access 1 Repo

TL;DR

This paper explores fine-tuning large language models for Python code question answering, analyzing various data preprocessing techniques and highlighting dataset quality issues affecting performance.

Contribution

It introduces a fine-tuning approach for LLMs on Python code QA datasets and evaluates the impact of data preprocessing methods on answer quality.

Findings

01

Grammar correction improves testing metrics.

02

Public datasets have poor quality affecting results.

03

Manual error analysis reveals common answer errors.

Abstract

Question answering over source code provides software engineers and project managers with helpful information about the implemented features of a software product. This paper presents a work devoted to using large language models for question answering over source code in Python. The proposed method for implementing a source code question answering system involves fine-tuning a large language model on a unified dataset of questions and answers for Python code. To achieve the highest quality answers, we tested various models trained on datasets preprocessed in different ways: a dataset without grammar correction, a dataset with grammar correction, and a dataset augmented with the generated summaries. The model answers were also analyzed for errors manually. We report BLEU-4, BERTScore F1, BLEURT, and Exact Match metric values, along with the conclusions from the manual error analysis.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iu-aes-ai4code/codequestionanswering
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems