# Explainable machine learning for incipient anomaly detection in compact molten salt heat exchanger with overlapping feature distributions

**Authors:** Konstantinos Prantikos, Taeseung Lee, Thanh Q. Hua, Lefteri H. Tsoukalas, Alexander Heifetz

PMC · DOI: 10.1038/s41598-025-27112-8 · 2026-03-06

## TL;DR

This paper introduces a compact heat exchanger design with fiber optic sensors and explainable machine learning to detect early faults in molten salt reactors.

## Contribution

A novel explainable ML framework combining Shapley values and POSETs for incipient anomaly detection in overlapping feature distributions.

## Key findings

- XGBoost outperformed other ML models in detecting early-stage faults in the heat exchanger.
- The explainability framework revealed dominant predictors and ambiguous feature relationships in the dataset.
- The proposed design enables localized fault detection with high-fidelity synthetic data.

## Abstract

High-temperature molten salt-cooled reactors (MSCRs) are a promising next-generation nuclear technology option, offering efficient power conversion and inherent safety features. However, the reliability of these systems depends on the robust operation of heat exchangers (HXs), which are susceptible to failure due to temperature gradients and channel plugging caused by fluid freezing. Conventional monitoring methods, relying on inlet and outlet measurements, lack the spatial resolution needed to detect early-stage faults. We propose a novel design of a compact salt-to-salt matrix-type HX design consisting of interleaved arrays of parallel tubes, with integrated synthetic fiber optic distributed temperature sensing (DTS) to enable localized detection of incipient faults. To evaluate performance of this design, we generate high-fidelity synthetic data using heat transfer computational modeling to simulate channel plugging, and introduce sensor noise for realistic modeling of measurements. The dataset comprises of 97% normal operation and 3% anomaly cases, with each anomaly class representing 1% of the data. These early anomalies result in overlapping temperature profiles between normal and faulty channels, producing a non-separable dataset that challenges traditional classification techniques. We benchmark eight supervised machine learning (ML) models and demonstrate that XGBoost achieves the highest performance. To improve transparency, we develop an explainability framework combining Shapley values and partially ordered sets (POSETs) to quantify and structurally analyze feature importance. This approach identifies both dominant predictors and ambiguous feature relationships, enhancing trust and interpretability. Our results highlight the potential of combining DTS and explainable ML with intelligent feature selection to improve predictive maintenance and ensure operational resilience in advanced nuclear systems.

## Full-text entities

- **Genes:** FBXL15 (F-box and leucine rich repeat protein 15) [NCBI Gene 79176] {aka FBXO37, Fbl15, JET}, HPX (hemopexin) [NCBI Gene 3263] {aka HX}, ITIH3 (inter-alpha-trypsin inhibitor heavy chain 3) [NCBI Gene 3699] {aka H3P, ITI-HC3, SHAP}
- **Diseases:** POSETs (MESH:D020920), ML (MESH:D007859), nuclear anomaly (MESH:D010381), FN (MESH:D017541)
- **Chemicals:** Water (MESH:D014867), lithium fluoride (MESH:C027651), salt (MESH:D012492), silica (MESH:D012822), sodium (MESH:D012964), DFOS (-), MgCl2 (MESH:D015636), NaCl (MESH:D012965)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12966489/full.md

---
Source: https://tomesphere.com/paper/PMC12966489