Hotfixing Large Language Models for Code
Zhou Yang, David Lo

TL;DR
This paper introduces a hotfixing approach for Large Language Models for Code (LLM4Code) to reduce buggy code generation efficiently without retraining from scratch, using targeted fine-tuning techniques.
Contribution
It proposes a novel hotfixing method with specific loss functions and fine-tuning strategies to mitigate undesired behaviors in LLM4Code models effectively.
Findings
Hotfixing increases fixed code generation by up to 108.42%.
Buggy code generation decreases by up to 50.47%.
Model correctness remains unaffected on HumanEval.
Abstract
Large Language Models for Code (LLM4Code) have become an integral part of developers' workflows, assisting with tasks such as code completion and generation. However, these models are found to exhibit undesired behaviors after their release, like generating buggy code, due to their extensive training on vast amounts of source code that contain such buggy code. The training data (usually coming from open-source software) keeps evolving, e.g., developers fix the buggy code. However, adapting such evolution to mitigate LLM4Code's undesired behaviors is non-trivial, as retraining models on the updated dataset usually takes much time and resources. This motivates us to propose the concept of hotfixing LLM4Code, mitigating LLM4Code's undesired behaviors effectively and efficiently with minimal negative effects. This paper mainly focuses on hotfixing LLM4Code to make them generate less buggy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
