# A machine learning model for predicting oligoclonal band positivity using routine cerebrospinal fluid and serum biochemical markers

**Authors:** Hazar Gözgöz, Oğuzhan Orhan, Başak Akan Konuk, Pınar Akan

PMC · DOI: 10.1093/ajcp/aqaf119 · American Journal of Clinical Pathology · 2025-12-06

## TL;DR

This study creates a machine learning model that predicts OCB positivity using routine lab data, improving diagnostic accuracy for intrathecal IgG synthesis.

## Contribution

A novel machine learning ensemble model that outperforms the conventional IgG index in predicting OCB positivity.

## Key findings

- The model achieved an ROC-AUC of 0.902 on the internal test set, outperforming the IgG index (ROC-AUC 0.795).
- It demonstrated 90% accuracy and 96% sensitivity on the external validation cohort.
- The model combines optimized CatBoost, XGBoost, and LightGBM classifiers for improved performance.

## Abstract

To develop and validate a machine learning model for predicting oligoclonal band (OCB) positivity using routine cerebrospinal fluid (CSF) and serum biochemical markers to improve the diagnostic accuracy and efficiency of assessing intrathecal immunoglobulin G (IgG) synthesis.

In this retrospective study (n = 1709), an ensemble model was developed using 8 refined CSF and serum parameters. Combining optimized CatBoost, XGBoost, and LightGBM classifiers, the model was trained and evaluated using a 2-phase workflow, including 5-fold cross-validation and validation on independent internal (n = 342) and external (n = 49) cohorts.

The developed ensemble model achieved a receiver operating characteristic–area under the curve (ROC-AUC) of 0.902 on the internal test set, significantly outperforming the conventional IgG index (ROC-AUC, 0.795). At its optimal threshold, the model demonstrated an accuracy of 0.830, with a sensitivity of 0.714 and a specificity of 0.916. On the external validation cohort, it achieved 90% accuracy and 96% sensitivity.

A novel machine learning ensemble model accurately predicts OCB positivity using routine laboratory data and demonstrates superior performance compared with the IgG index. This approach represents a significant step in applying artificial intelligence in laboratory medicine, with the potential to enhance diagnostic efficiency. Prospective, multicenter validation is essential for broader clinical implementation.

## Full-text entities

- **Genes:** ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}, CD5 (CD5 molecule) [NCBI Gene 921] {aka LEU1, T1}, CX3CL1 (C-X3-C motif chemokine ligand 1) [NCBI Gene 6376] {aka ABCD-3, C3Xkine, CXC3, CXC3C, NTN, NTT}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, CRP (C-reactive protein) [NCBI Gene 1401] {aka PTX1}, KCNJ10 (potassium inwardly rectifying channel subfamily J member 10) [NCBI Gene 3766] {aka BIRK-10, KCNJ13-PEN, KIR1.2, KIR4.1, SESAME}, HK1 (hexokinase 1) [NCBI Gene 3098] {aka CNSHA5, HK, HK1-ta, HK1-tb, HK1-tc, HKD}, IL12B (interleukin 12B) [NCBI Gene 3593] {aka CLMF, CLMF2, IL-12B, IMD28, IMD29, NKSF}, SLC12A1 (solute carrier family 12 member 1) [NCBI Gene 6557] {aka BSC, BSC-1, BSC1, CCC2, NKCC2}
- **Diseases:** demyelination and axonal injury (MESH:D003711), neuroinflammation (MESH:D000090862), icterus (MESH:D007565), autoimmune disease (MESH:D001327), lipemia (MESH:D006949), neurologic diseases (MESH:D020271), hemolysis (MESH:D006461), axonal damage (MESH:D001480), inflammatory, infectious, and systemic diseases (MESH:D003141), neurologic disability (MESH:D009069), MODEL (MESH:D004195), inflammation (MESH:D007249), ML (MESH:D007859), OCB (MESH:D058745), MS (MESH:D009103)
- **Chemicals:** glucose (MESH:D005947), lactic acid (MESH:D019344), sodium (MESH:D012964), microalbumin (-), potassium (MESH:D011188), chloride (MESH:D002712)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12782304/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12782304/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12782304/full.md

---
Source: https://tomesphere.com/paper/PMC12782304