# Identification of immune and major depressive disorder-related diagnostic markers for early nonalcoholic fatty liver disease by WGCNA and machine learning

**Authors:** Yuyun Jia, Yanping Cao, Qin Yin, Xueqian Li, Xiu Wen

PMC · DOI: 10.3389/fbinf.2025.1594971 · Frontiers in Bioinformatics · 2025-06-26

## TL;DR

This study identifies immune and depression-related genes that can help detect early nonalcoholic fatty liver disease and improve targeted care for patients with both conditions.

## Contribution

A novel eight-gene signature linked to immune and depression pathways for early NAFLD diagnosis in comorbid patients.

## Key findings

- Fourteen genes were identified as strongly linked to both the immune system and MDD/NAFLD.
- An eight-gene signature showed high diagnostic accuracy for early-stage NAFLD.
- Immune cell infiltration patterns differed significantly between patients and healthy controls.

## Abstract

Major depressive disorder (MDD) and nonalcoholic fatty liver disease (NAFLD) are highly prevalent conditions that exhibit significant pathophysiological overlap, particularly in metabolic and immune pathways.

This study aims to bridge this gap by integrating transcriptomic data from publicly available repositories and advanced machine learning algorithms to identify novel biomarkers and construct a predictive model facilitates the provision of clinical psychological nursing interventions for early-stage NAFLD in MDD patients.

We systematically analyzed transcriptomic data of simple steatosis (SS), nonalcoholic steatohepatitis (NASH), and major depressive disorder (MDD) from GEO databases to construct and validate a diagnostic model. After removing batch effects, we identified differentially expressed genes (DEGs) that distinguished disease and control groups. We further applied Weighted Gene Co-expression Network Analysis (WGCNA) to identify immune-related genes in SS/NASH patients versus controls. The intersection of shared DEGs across both conditions and WGCNA-identified genes was determined and subjected to functional enrichment analysis. Immune cell infiltration levels were quantified using single-sample gene set enrichment analysis (ssGSEA). A predictive model for SS/NASH was developed by evaluating nine machine-learning algorithms with 10-fold cross-validation on the datasets.

Fourteen genes strongly linked to both the immune system and the two conditions were identified. Immune cell infiltration profiling revealed distinct immune landscapes in patients versus healthy controls. Moreover, an eight-gene signature was developed, demonstrating superior diagnostic accuracy in both testing and training cohorts. Notably, these eight genes were found to correlate with the severity of early-stage NAFLD.

This study established a predictive model for early-stage NAFLD through the integration of bioinformatics and machine learning approaches, with a focus on immune- and MDD-related genes. The eight-gene signature identified in this study represents a novel diagnostic tool for precision medicine, enabling targeted psychological nursing intervention in comorbid populations.

## Linked entities

- **Diseases:** Major depressive disorder (MONDO:0002009), Nonalcoholic fatty liver disease (MONDO:0013209), Nonalcoholic steatohepatitis (MONDO:0007027)

## Full-text entities

- **Diseases:** NAFLD (MESH:D065626), MDD (MESH:D003865), SS (MESH:D005234)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12271764/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12271764/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/PMC12271764/full.md

---
Source: https://tomesphere.com/paper/PMC12271764