Hotfixing Large Language Models for Code

Zhou Yang; David Lo

arXiv:2408.05727·cs.SE·November 7, 2024

Hotfixing Large Language Models for Code

Zhou Yang, David Lo

PDF

Open Access

TL;DR

This paper introduces a hotfixing approach for Large Language Models for Code (LLM4Code) to reduce buggy code generation efficiently without retraining from scratch, using targeted fine-tuning techniques.

Contribution

It proposes a novel hotfixing method with specific loss functions and fine-tuning strategies to mitigate undesired behaviors in LLM4Code models effectively.

Findings

01

Hotfixing increases fixed code generation by up to 108.42%.

02

Buggy code generation decreases by up to 50.47%.

03

Model correctness remains unaffected on HumanEval.

Abstract

Large Language Models for Code (LLM4Code) have become an integral part of developers' workflows, assisting with tasks such as code completion and generation. However, these models are found to exhibit undesired behaviors after their release, like generating buggy code, due to their extensive training on vast amounts of source code that contain such buggy code. The training data (usually coming from open-source software) keeps evolving, e.g., developers fix the buggy code. However, adapting such evolution to mitigate LLM4Code's undesired behaviors is non-trivial, as retraining models on the updated dataset usually takes much time and resources. This motivates us to propose the concept of hotfixing LLM4Code, mitigating LLM4Code's undesired behaviors effectively and efficiently with minimal negative effects. This paper mainly focuses on hotfixing LLM4Code to make them generate less buggy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques