Neurons for Neutrons: A Transformer Model for Computation Load Estimation on Domain-Decomposed Neutron Transport Problems

Alexander Mote; Todd Palmer; Lizhong Chen

arXiv:2411.03389·physics.comp-ph·August 18, 2025

Neurons for Neutrons: A Transformer Model for Computation Load Estimation on Domain-Decomposed Neutron Transport Problems

Alexander Mote, Todd Palmer, Lizhong Chen

PDF

Open Access

TL;DR

This paper introduces a Transformer-based model that accurately predicts computation loads in domain-decomposed neutron transport problems, reducing the need for time-consuming simulations and enabling faster, scalable analysis.

Contribution

The paper presents a novel Transformer model with 3D embeddings tailored for neutron transport problems, achieving high accuracy and eliminating the need for repeated small-scale simulations.

Findings

01

Achieves 98.2% accuracy in load prediction

02

Can bypass small-scale simulations entirely

03

Demonstrates robustness across different geometries and parameters

Abstract

Domain decomposition is a technique used to reduce memory overhead on large neutron transport problems. Currently, the optimal load-balanced processor allocation for these domains is typically determined through small-scale simulations of the problem, which can be time-consuming for researchers and must be repeated anytime a problem input is changed. We propose a Transformer model with a unique 3D input embedding, and input representations designed for domain-decomposed neutron transport problems, which can predict the subdomain computation loads generated by small-scale simulations. We demonstrate that such a model trained on domain-decomposed Small Modular Reactor (SMR) simulations achieves 98.2% accuracy while being able to skip the small-scale simulation step entirely. Tests of the model's robustness on variant fuel assemblies, other problem geometries, and changes in simulation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNuclear reactor physics and engineering · Nuclear Physics and Applications

MethodsLinear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Attention Is All You Need · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings