Leveraging Large Language Models for Efficient Failure Analysis in Game Development
Leonardo Marini, Linus Gissl\'en, and Alessandro Sestini

TL;DR
This paper introduces a novel method using Large Language Models to automatically identify code changes responsible for test failures in game development, significantly reducing debugging time and improving bug detection accuracy.
Contribution
The paper presents a new LLM-based approach for pinpointing failure-inducing code changes, demonstrating its effectiveness in a large-scale game development environment.
Findings
Achieved 71% accuracy in identifying failure causes.
Reduced debugging time by up to 60%.
Validated effectiveness through quantitative and user studies.
Abstract
In games, and more generally in the field of software development, early detection of bugs is vital to maintain a high quality of the final product. Automated tests are a powerful tool that can catch a problem earlier in development by executing periodically. As an example, when new code is submitted to the code base, a new automated test verifies these changes. However, identifying the specific change responsible for a test failure becomes harder when dealing with batches of changes -- especially in the case of a large-scale project such as a AAA game, where thousands of people contribute to a single code base. This paper proposes a new approach to automatically identify which change in the code caused a test to fail. The method leverages Large Language Models (LLMs) to associate error messages with the corresponding code changes causing the failure. We investigate the effectiveness of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Software Testing and Debugging Techniques
