# Machine Learning-Based Validation of LDHC and SLC35G2 Methylation as Epigenetic Biomarkers for Food Allergy

**Authors:** Sabire Kiliçarslan, Meliha Merve Hiz Çiçekliyurt, Serhat Kiliçarslan, Dina S. M. Hassan, Nagwan Abdel Samee, Ahmet Kurtoglu

PMC · DOI: 10.3390/biomedicines13102489 · Biomedicines · 2025-10-13

## TL;DR

This study uses machine learning and DNA methylation data to identify LDHC and SLC35G2 as potential biomarkers for accurately diagnosing food allergies.

## Contribution

The novel integration of machine learning and deep learning with DNA methylation data identifies and validates LDHC and SLC35G2 as epigenetic biomarkers for food allergy.

## Key findings

- LDHC and SLC35G2 methylation patterns are promising biomarkers for distinguishing food allergies from sensitivities.
- Machine learning models combined with deep learning improve diagnostic accuracy and uncover epigenetic patterns.
- Validation on an external dataset confirms the reproducibility of the biomarkers across independent cohorts.

## Abstract

Background: Food allergies represent a growing global health concern, yet the current diagnostic methods often fail to distinguish between true allergies and food sensitivities, leading to misdiagnoses and inadequate treatment. Epigenetic alterations, such as DNA methylation (DNAm), may offer novel biomarkers for precise diagnosis. Methods: This study employed a computational machine learning framework integrated with DNAm data to identify potential biomarkers and enhance diagnostic accuracy. Differential methylation analysis was performed using the limma package to identify informative CpG features, which were then analyzed with advanced algorithms, including SVM (polynomial and RBF kernels), k-NN, Random Forest, and artificial neural networks (ANN). Deep learning via a stacked autoencoder (SAE) further enriched the analysis by uncovering epigenetic patterns and reducing feature dimensionality. To ensure robustness, the identified biomarkers were independently validated using the external dataset GSE114135. Results: The hybrid machine learning models revealed LDHC and SLC35G2 methylation as promising biomarkers for food allergy prediction. Notably, the methylation pattern of the LDHC gene showed significant potential in distinguishing individuals with food allergies from those with food sensitivity. Additionally, the integration of machine learning and deep learning provided a robust platform for analyzing complex epigenetic data. Importantly, validation on GSE114135 confirmed the reproducibility and reliability of these findings across independent cohorts. Conclusions: This study demonstrates the potential of combining machine learning with DNAm data to advance precision medicine in food allergy diagnosis. The results highlight LDHC and SLC35G2 as robust epigenetic biomarkers, validated across two independent datasets (GSE114134 and GSE114135). These findings underscore the importance of developing clinical tests that incorporate these biomarkers to reduce misdiagnosis and lay the groundwork for exploring epigenetic regulation in allergic diseases.

## Linked entities

- **Genes:** LDHC (lactate dehydrogenase C) [NCBI Gene 3948], SLC35G2 (solute carrier family 35 member G2) [NCBI Gene 80723]
- **Diseases:** food allergy (MONDO:0700226)

## Full-text entities

- **Genes:** LDHC (lactate dehydrogenase C) [NCBI Gene 3948] {aka CT32, LDH3, LDHX}, SLC35G2 (solute carrier family 35 member G2) [NCBI Gene 80723] {aka TMEM22}
- **Diseases:** allergic diseases (MESH:D004342), Food Allergy (MESH:D005512)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12561535/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12561535/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/PMC12561535/full.md

---
Source: https://tomesphere.com/paper/PMC12561535