An empirical study of ChatGPT-3.5 on question answering and code maintenance
Md Mahir Asef Kabir, Sk Adnan Hassan, Xiaoyin Wang, Ying Wang, Hai Yu,, Na Meng

TL;DR
This empirical study compares ChatGPT-3.5 with human programmers in question answering and code maintenance, revealing ChatGPT's strong performance and potential impact on the software industry.
Contribution
It systematically evaluates ChatGPT's capabilities against programmers in answering technical questions and revising code, providing new insights into its practical effectiveness.
Findings
ChatGPT provided better answers for 97 out of 130 questions.
Developers preferred ChatGPT answers in 203 of 300 ratings.
ChatGPT correctly revised code for 22 of 48 maintenance tasks.
Abstract
Ever since the launch of ChatGPT in 2022, a rising concern is whether ChatGPT will replace programmers and kill jobs. Motivated by this widespread concern, we conducted an empirical study to systematically compare ChatGPT against programmers in question-answering and software-maintaining. We reused a dataset introduced by prior work, which includes 130 StackOverflow (SO) discussion threads referred to by the Java developers of 357 GitHub projects. We mainly investigated three research questions (RQs). First, how does ChatGPT compare with programmers when answering technical questions? Second, how do developers perceive the differences between ChatGPT's answers and SO answers? Third, how does ChatGPT compare with humans when revising code for maintenance requests? For RQ1, we provided the 130 SO questions to ChatGPT, and manually compared ChatGPT answers with the accepted/most popular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Software Engineering Research · Online Learning and Analytics
