# Machine Learning-Enhanced MALDI-TOF Mass Spectrometry for Screening HBsAg-Positive Patients

**Authors:** Tiantian Zhang, Shixuan Huang, Junxun Li, Yuwei Wu, Xinyu Zhao, He Gao, Juan Yang, Lingshuang Yang, Lulu Cao, Xinqiang Xie, Hui Zhao, Jing Cheng, Hongxia Tan, Ying Li, Qingping Wu

PMC · DOI: 10.3390/microorganisms14030702 · Microorganisms · 2026-03-20

## TL;DR

This study combines MALDI-TOF mass spectrometry with machine learning to rapidly and affordably screen for hepatitis B surface antigen in large populations.

## Contribution

A novel machine learning-enhanced MALDI-TOF MS method for HBsAg screening with low cost and high throughput is proposed.

## Key findings

- The LightGBM model achieved an AUC of 0.94 and an F1 score of 0.87 for HBsAg screening.
- Twelve stable m/z peaks were identified as potential biomarkers for HBsAg-positive status.
- The method offers a detection time of ~1 minute and a per-sample cost of ~$0.14.

## Abstract

Hepatitis B virus (HBV) remains a major global public health challenge, and its early screening is essential for controlling transmission and improving treatment outcomes. We analyzed serum samples from 422 participants via Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to establish a screening model for hepatitis B surface antigen (HBsAg)-positive status. Following multi-bin preprocessing and single-sample spectral aggregation, we assessed three machine learning algorithms—random forest, deep neural network, and light gradient boosting machine (LightGBM). Among them, the LightGBM model achieved the best performance, with an optimized F1 score of 0.87 and an area under the receiver operating characteristic curve (AUC) of 0.94. A 100-iteration ensemble feature stabilization strategy identified twelve distinct m/z peaks as stable biomarkers for HBsAg-positive screening. Independent validation yielded sensitivity of 77.7% and specificity of 76.0%—insufficient for individual diagnosis but potentially suitable for population-level surveillance programs combined with confirmatory testing, particularly in resource-limited settings where conventional methods are impractical. Notably, the method offers a detection time of approximately one minute, a per-sample cost of ~$0.14. In conclusion, the combination of MALDI-TOF MS and machine learning enables a rapid, low-cost screening tool for large-scale HBV detection.

## Full-text entities

- **Species:** Hepatitis B virus (no rank) [taxon 10407], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13028653/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13028653/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC13028653/full.md

---
Source: https://tomesphere.com/paper/PMC13028653