XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion
Xiao Wang, Qingquan Yang, Fuling Wang, Qiang Chen, Wentao Wu, Yu Jin,, Jingtao Jiang, Liye Jin, Bo Jiang, Dengdi Sun, Wanli Lv, Meiwen Chen, Zehua, Chen, Guosheng Xu, Jin Tang

TL;DR
XiHeFusion is a large language model specifically trained on nuclear fusion knowledge to improve science communication and public understanding of nuclear fusion, leveraging fine-tuning and reasoning enhancement techniques.
Contribution
This paper introduces the first large language model dedicated to nuclear fusion, trained on multi-source data and enhanced with logical reasoning capabilities.
Findings
XiHeFusion performs well in answering science popularization questions.
The model demonstrates improved logical reasoning in nuclear fusion topics.
Pre-trained model is publicly available for further research.
Abstract
Nuclear fusion is one of the most promising ways for humans to obtain infinite energy. Currently, with the rapid development of artificial intelligence, the mission of nuclear fusion has also entered a critical period of its development. How to let more people to understand nuclear fusion and join in its research is one of the effective means to accelerate the implementation of fusion. This paper proposes the first large model in the field of nuclear fusion, XiHeFusion, which is obtained through supervised fine-tuning based on the open-source large model Qwen2.5-14B. We have collected multi-source knowledge about nuclear fusion tasks to support the training of this model, including the common crawl, eBooks, arXiv, dissertation, etc. After the model has mastered the knowledge of the nuclear fusion field, we further used the chain of thought to enhance its logical reasoning ability,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech Recognition and Synthesis · Data Quality and Management
