ChatQCD: Let Large Language Models Explore QCD
Antonin Sulc, Patrick L.S. Connor

TL;DR
This paper introduces ChatQCD, a large language model designed to explore and generate scientific texts related to quantum chromodynamics, aiming to accelerate research and knowledge consolidation in high-energy physics.
Contribution
It constructs a specialized corpus and develops a generative model for QCD, pioneering efforts to automate scientific knowledge synthesis in this domain.
Findings
Built a QCD-specific scientific corpus
Developed a generative model for QCD texts
Discussed future challenges in LLM applications for scientific research
Abstract
Quantum chromodynamics (QCD) has yielded a vast literature spanning distinct phenomena. We construct a corpus of papers and build a generative model. This model holds promise for accelerating the capability of scientists to consolidate their knowledge of QCD by the ability to generate and validate scientific works in the landscape of works related to QCD and similar problems in HEP. Furthermore, we discuss challenges and future directions of using large language models to integrate our scientific knowledge about QCD through the automated generation of explanatory scientific texts.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Computational Physics and Python Applications · Machine Learning in Healthcare
