Trees and Islands -- Machine learning approach to nuclear physics
Nishchal R. Dwivedi

TL;DR
This paper applies machine learning, specifically gradient boosted trees, to nuclear physics data to predict various nuclear properties, including those of superheavy elements, demonstrating the method's accuracy and potential for scientific insights.
Contribution
It introduces a data-driven machine learning approach using gradient boosted trees to predict complex nuclear physics parameters, including for superheavy elements.
Findings
Predicted nuclear properties with low standard deviation.
Successfully modeled level density parameters for superheavy elements.
Demonstrated machine learning's effectiveness in nuclear physics prediction tasks.
Abstract
We implement machine learning algorithms to nuclear data. These algorithms are purely data driven and generate models that are capable to capture intricate trends. Gradient boosted trees algorithm is employed to generate a trained model from existing nuclear data, which is used for prediction for data of damping parameter, shell correction energies, quadrupole deformation, pairing gaps, level densities and giant dipole resonance for large number of nuclei. We, in particular, predict level density parameter for superheavy elements which is of great current interest. The predictions made by the machine learning algorithm is found to have standard deviation from 0.00035 to 0.73.
| Element | Z | GC | STF | ||
|---|---|---|---|---|---|
| 104 | 27.44 | 9.73 | 19.85 | 13.45 | |
| 105 | 27.62 | 9.70 | 19.88 | 13.48 | |
| 106 | 27.67 | 9.72 | 19.98 | 13.46 | |
| 107 | 24.51 | 9.81 | 19.98 | 13.51 | |
| 108 | 27.10 | 11.20 | 19.98 | 13.51 | |
| 109 | 27.40 | 10.14 | 19.97 | 13.92 | |
| 110 | 27.44 | 10.21 | 19.97 | 14.07 | |
| 111 | 27.40 | 10.29 | 19.97 | 14.12 | |
| 112 | 27.44 | 10.39 | 19.94 | 14.29 | |
| 113 | 27.40 | 10.43 | 19.95 | 14.33 | |
| 114 | 27.44 | 10.53 | 19.93 | 14.50 | |
| 115 | 27.62 | 10.50 | 19.93 | 14.55 | |
| 116 | 27.44 | 10.67 | 19.92 | 14.70 | |
| 117 | 27.66 | 10.62 | 19.94 | 14.75 | |
| 118 | 27.55 | 10.67 | 19.94 | 14.75 | |
| 119 | 27.44 | 11.48 | 19.93 | 15.81 | |
| 120 | 27.55 | 10.85 | 20.22 | 14.79 | |
| 121 | 27.40 | 11.68 | 19.96 | 16.04 | |
| 122 | 27.43 | 11.15 | 21.26 | 14.39 |
| Observable | (2 features) | (7 features) |
|---|---|---|
| Damping parameter | 0.00170 | 0.00035 |
| Shell Correction Energy | 0.910 | 0.553 |
| deformation | 0.018 | 0.015 |
| Proton pairing gap | 0.040 | 0.037 |
| Neutron pairing gap | 0.640 | 0.037 |
| LDP (GC) | 0.790 | 0.730 |
| LDP (STF) | 0.600 | 0.300 |
| (3 features) | (8 features) |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNuclear physics research studies · Nuclear Physics and Applications · Nuclear reactor physics and engineering
Trees and Islands – Machine learning approach to nuclear physics
Nishchal R. Dwivedi
Nuclear Physics Division, Bhabha Atomic Research Centre, Trombay, Mumbai 400 085, India
and
Department of Physics, University of Mumbai, Vidyanagari Campus, Mumbai 400 098, India
Abstract
We implement machine learning algorithms to nuclear data. These algorithms are purely data driven and generate models that are capable to capture intricate trends. Gradient boosted trees algorithm is employed to generate a trained model from existing nuclear data, which is used for prediction for data of damping parameter, shell correction energies, quadrupole deformation, pairing gaps, level densities and giant dipole resonance for large number of nuclei. We, in particular, predict level density parameter for superheavy elements which is of great current interest. The predictions made by the machine learning algorithm is found to have standard deviation from 0.00035 to 0.73.
1 Introduction
Machine Learning (ML) and Artificial Intelligence (AI) have found a place in our contemporary technology to discover patterns, classify and predict any large data. These classifications and predictions find applications from social media, online behaviour[1, 2] to banking, stock market movement [3], and so on. These algorithms are used by various social platforms to personalise our experience on the web, by suggesting us news[4], advertisements [5] and pictures [6] of our interest. Recent use of ML in high energy physics for search of exotic particles [7], and making faster computations in molecular dynamics calculations [8] has opened the vistas of their usage in fundamental sciences.
Due to the capability of harnessing of nuclear technology in medicine, agriculture, and clean energy, there is a need to have a collective database of various nuclear observables for research and diagnostic purposes. Organisations like IAEA have been maintaining the database as a standard and have a collection of observed and evaluated data of nuclei since late 1960’s.
ML prediction algorithms do not just provide a regression curve, but are capable of capturing more complex patterns in the data. In this letter we explore such an algorithm for model generation and prediction of nuclear observables. We use the available nuclear data and apply the Gradient Boosted Trees [9] prediction algorithm. This algorithm is trained on randomly sampled 60% of the data, with a part of this 60% used for validation. This training leads to a prediction model, which is generated as a regression tree. GBT minimizes the loss function of mean square error, and modifies the original model by correcting on the errors iteratively, till either the error minimises to a constant or the model starts to overfit. Overfitting is avoided by constantly checking the accuracy of the measurement from the validation set. The rest of the 40% of data is used for testing the generated model. This ratio of 60%-40% is taken after optimisation and is found to work well with the considered observables. We define a standard deviation (), to quantify predictions, as the root mean square of difference between predicted values by the model and actual values given by the test data. Further, standard error is defined as the standard deviation divided by the square root of size of the sample.
Recent works [10, 11] use ML techniques to predict nuclear mass and charge radii using Support Vector Machines and Neural Networks. These works use two input parameters (features) of number of neutrons and number of protons. These techniques though give a good predictions for the above observables, but they do not predict with similar efficiency for other nuclear data. Further, the ML algorithms learn better with more number of relevant features. In our work, we do feature engineering for the problem and increase the number for features from two to seven and then generate a prediction model for various nuclear data. We further employ the ML algorithm for two features for the same set of data and compare the results in Table 2.
2 Classification and prediction
Consider a set of data where the input variables (features) give a resultant output through some process. The aim is to find a model , such that, , where . In a typical ML scenario, a training set of a given data is used to teach the algorithm patterns in the output ’s. Based on this learning, the ML algorithm generates a model. This model can be a simple linear regression, a collection of trees (Random Forest), based on Neural networks, a classification of clustering (based on Support Vector machine), or any other of such techniques, depending on the problem. This generated model can then be used to predict ’s for ’s of interest, for which data is not even available.
Machine learning algorithms help in both classifying and predicting types of problems. In classification problems, the output is a discrete number, indicating classes or limited number of possible states where the data can exist. Here, a classification algorithm after learning gives the most probable class in which the state with a given set of features can exist in.
In predicting problems, the features correspond to a continuous value of . Here, the algorithm has to learn through regression techniques to predict how the data trend will follow for the features of interest.
Gradient boosted trees (GBT) is a ML technique aimed at minimizing the loss function. Here, we demonstrate machine learning methods in nuclear physics using GBT.
3 Calculations and results
Modelling a nucleus and understanding its energy levels [12] has been of great interest. The famous liquid drop model was proposed by Gamow and for which Bethe and von Weizsäcker [13, 14] proposed a semi empirical mass formula to describe a nucleus. This formula is written in terms of volume, surface, Coulomb interaction, mass asymmetry and pairing energies. In our approach, we treat these along with the number of protons and neutrons as the features of the problem, as they are capable to model nuclear phenomena [15, 16]. After training the nuclear data, a model is generated which is used to give predictions on the test set, for which the actual values are known. If we plot the actual value vs the predicted values of the test set, the points should fall on the line, for perfect prediction.
3.1 Shell correction energy and damping parameter,
Statistical models have been very useful to understand nuclei and their transitions. Usually, Fermi gas model [17] is used to predict level density of the nucleus, but this model does not incorporate the shell effects. To incorporate shell corrections given by Strutinsky [18], phenomenological relations are used. One such popularly used relation is by Ignatyuk [19] as follows:
[TABLE]
where is the asymptotic value of nuclear level density parameter. The damping coefficient, denoted by , is obtained by fitting gamma spectrum [20] and denotes the shell correction energy which is given as the difference between the experimental binding energy of a nucleus and the binding energy calculated from the liquid drop model [21]. The Gilbert-Cameron model [22] calculates values by using neutron resonance data for various nuclei. This calculated data for 290 nuclei from 11 to 98 [23] were used for calculations. 6735 nuclei from to 99 [23] were used for prediction. These values of are calculated using Mengoni-Nakajima mass formula [24]. The prediction from this data for the damping coefficient is given by Fig. 1 and for shell correction energy is given by Fig. 2. The prediction for has the standard deviation as 0.00035, which shows an excellent prediction by the GBT method, and for is 0.553, which shows a good prediction.
3.2 deformation
Understanding the shape and structure of a nucleus gives us information about the energy levels and transitions among them [25]. Usually, the ground state deformation is calculated by using a finite – range liquid drop model [26]. These have been tabulated in the RIPL library at [23]. We use the quadrupole deformation values for 8983 nuclei from to 136 from this library. We see that the test set values show a good prediction from the generated model with the standard deviation of 0.015 as seen in Fig. 3.
3.3 Pairing gap
The possibility of existence of pairing in the nuclei was postulated by Bohr and others [27], which was a success in explaining the dependence of the binding energy of the nucleus with the even and odd number of protons and neutrons. The pairing gap calculated using BCS pairing model with energy levels obtained by the folded-Yukawa single-particle model [28] for 8979 nuclei from 8 to 136 [29] is used. The predictions for the test set for proton and neutron pairing gaps are shown in Fig. 4 and Fig. 5 respectively with their respective standard deviations as 0.037 and 0.023.
3.4 Level density parameter
Level density parameter (LDP) is the most important extracted quantity to understand nuclear observables like neutron resonances and reaction cross sections [30]. Recent semiclassical trace formula (STF) [31] evaluates the temperature- dependent LDP within 10%-15% of the experimental values for magic and semi magic nuclei. This trace formula models nucleus using Harmonic oscillator [32] with spin-orbit interactions [33]. This formalism also incorporates the shell effects in the nuclei [34].
The Gilbert-Cameron (GC) model fits the Fermi gas model with the neutron resonances to obtain asymptotic LDP. These LDP are tabulated in [23] for 289 nuclei from to 98.
The results from GC show (Fig. 6) a standard variation of 0.73 and a good agreement of the asymptotic level densities.
The STF is an exact formula with no adjustable parameters and this formalism also gives level density as a function of temperature. We train the algorithm for temperature dependent level density for 32 nuclei for various temperatures. We have 3000 data points. Temperature is added as a feature in this case along with the liquid drop energies and number of neutrons and protons. The ML generated model shows a standard deviation of 0.3 and the predicted values show a good agreement with the actual values (Fig. 7).
We further venture towards the island of stability and predict the LDP of superheavy elements.
3.5 Superheavy elements
The synthesis of superheavy elements (SHE) is a big challenge in nuclear physics and is of great contemporary interest due to its importance in heavy ion fusion reactions [35]. The variation of LDP with the number of nucleons is generally seen to have an trend, where is the mass number of the nucleus and is the positive real number. Calculations suggest that for SHE, is about 11-13 [36]. We use the ML models generated from GC and STF to predict the level densities of some super heavy elements as in Table 1 and calculate the respective - values as and . We find that the STF does better than GC as expected, due to its exact nature, more information on temperature dependence and incorporation of spin orbit interaction in the trace formula.
3.6 Giant Dipole Resonance
During a photonuclear reaction, signatures of the nuclei behaving like a giant dipole with collective oscillations [37] are observed. These oscillations are called as Giant Dipole Resonance (GDR). It corresponds to the fundamental absorption frequency of electric dipole radiation of the nucleus acting as a whole. It is often simply understood as oscillations in the nucleus of the neutrons against the protons. GDR dominates the photo-absorption process at 10-30 MeV of energies [38].
We take the experimentally observed values for 180 nuclei for their observed first peak of energy corresponding to GDR [23]. We then train our model and predict for a randomly selected test set. These experimental values give predictions with standard deviation of 0.76. The experimental standard deviation of these values from various experiments lie between 0.01 to 11.74.
4 Conclusion
Nuclear data has been studied extensively using a machine learning algorithm. Phenomenological quantities like damping factor and shell correction energies, quantities with quantum mechanical origin like pairing gaps, evaluated values of quadrupole deformations, statistical quantities of level densities and experimental quantities of first peak energy of giant dipole resonances have been modelled. The predictions from these models show a very good agreement with the actual values with standard deviations ranging from 0.00035 to 0.73. To compare with contemporary literature, similar investigations on time series analysis [39], molecular physics [40], astrophysics [41], economics [42, 43] and climate studies [44] report standard deviations ranging between 0.02 to 75.0. Our results, thus, are well acceptable, particularly as they are better by two orders of magnitude for damping parameter, . We use these predictions to calculate the level density parameter of the superheavy elements in the island of stability, the values obtained sharpen the expected values from proposed theories. We also show how the engineered features increase the accuracy of the predictions by the generated models.
These algorithms are based and derived only from the data and not from nuclear theoretical modelling or phenomenological analysis of the experiments. Though the theoretical basis of these algorithms are statistical in nature, they capture the patterns in the data very well. This is somewhat reminiscent of the success of ideas originating in random matrix theory which work very well in nuclear physics, quantum chaos and so on [45]. We hope that this work will pave the way for further investigations using machine learning and AI as tools in the fields of physics where a lot of data is available while a clear cut theory is not yet available.
Alternatively, the limited number of features in the problem and good predictions, makes ML a good tool to get an insight for global mathematical modelling of nuclei, which is valid for all ranges of mass numbers. ML can be instrumental in understanding nuclear properties for which the experiments are not yet feasible.
5 Acknowledgements
The author expresses his deep gratitude to Prof. Sudhir Ranjan Jain (BARC) for discussions which led to this problem. The author is grateful to Aayushi Gupta (Indira Gandhi Institute of Development Research, Mumbai), Raman Sehgal (BARC) and Dr. Kaushal Sharma (Inter-University Centre for Astronomy and Astrophysics, Pune) for involved discussions.
The author acknowledges gratefully the funding for research from the Department of Atomic Energy and University of Mumbai - Bhabha Atomic Research Centre collaboration.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Kyumin Lee, James Caverlee, and Steve Webb. Uncovering social spammers: social honeypots+ machine learning. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval , pages 435–442. ACM, 2010.
- 2[2] Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining , pages 701–710. ACM, 2014.
- 3[3] Wei Huang, Yoshiteru Nakamori, and Shou-Yang Wang. Forecasting stock market movement direction with support vector machine. Computers & Operations Research , 32(10):2513–2522, 2005.
- 4[4] Jiahui Liu, Peter Dolan, and Elin Rønby Pedersen. Personalized news recommendation based on click behavior. In Proceedings of the 15th international conference on Intelligent user interfaces , pages 31–40. ACM, 2010.
- 5[5] Suman Nath, Felix Xiaozhu Lin, Lenin Ravindranath, and Jitendra Padhye. Smartads: bringing contextual ads to mobile apps. In Proceeding of the 11th annual international conference on Mobile systems, applications, and services , pages 111–124. ACM, 2013.
- 6[6] Luc Steels. Collaborative tagging as distributed cognition. Pragmatics & cognition , 14(2):287–292, 2006.
- 7[7] Pierre Baldi, Peter Sadowski, and Daniel Whiteson. Searching for exotic particles in high-energy physics with deep learning. Nature communications , 5:4308, 2014.
- 8[8] Venkatesh Botu and Rampi Ramprasad. Adaptive machine learning framework to accelerate ab initio molecular dynamics. International Journal of Quantum Chemistry , 115(16):1074–1083, 2015.
