Development of an LLM-Based System for Automatic Code Generation from HEP Publications
Masahiko Saito, Tomoe Kishimoto, Junichi Tanaka

TL;DR
This paper presents a prototype system using large language models to extract analysis procedures from high-energy physics publications and generate executable code for result reproduction, highlighting current capabilities and challenges.
Contribution
It introduces a two-stage LLM-based approach for automating HEP analysis reproduction, including extraction and code generation, with initial promising results and identified limitations.
Findings
LLMs can recover many documented selection criteria.
Generated event selections can match baseline implementations.
Challenges include stochasticity, hallucination, and execution failures.
Abstract
Ensuring the reproducibility of physics results is one of the crucial challenges in high-energy physics (HEP). In this study, we develop a proof-of-concept system that uses large language models (LLMs) to extract analysis procedures from HEP publications and generate executable analysis code for reproducing published results. Our method consists of two stages. In the first stage, open-weight LLMs extract event selection criteria, object definitions, and other relevant analysis information from a target paper and, when necessary, from its referenced publications, and then produce a structured selection list. In the second stage, the structured selection list is used to generate analysis code, which is then executed and validated iteratively. As a benchmark, we use the ATLAS analysis based on proton-proton collision data recorded in 2015 and 2016 and released…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
