# Single cell RNA sequencing analysis and molecular biology experiments identify NAFLD-related fibroblasts and highlights seven hub genes in NAFLD

**Authors:** Xuechun Zhang, Ying Cheng, Runzhi Yu, Xiaona Hu, Yiqin Huang

PMC · DOI: 10.1371/journal.pone.0332881 · PLOS One · 2026-02-02

## TL;DR

This study identifies a specific fibroblast population and seven key genes linked to nonalcoholic fatty liver disease (NAFLD), offering insights into its progression and potential diagnostic markers.

## Contribution

The study introduces a novel genetic profile of seven hub genes associated with NAFLD pathogenesis, identified through single-cell RNA sequencing and machine learning.

## Key findings

- A distinct fibroblast subset was identified in NAFLD patients, expressing genes related to disease progression.
- Seven hub genes were identified and validated in vivo and in vitro, showing significant expression differences in NAFLD models.
- The genes are functionally linked to the cell cycle and immune infiltration, suggesting roles in NAFLD development.

## Abstract

The Nonalcoholic fatty liver disease (NAFLD) is a major chronic liver condition, with its pathology remaining elusive. Consequently, a thorough exploration of the underlying mechanisms driving NAFLD progression is essential for a comprehensive understanding of this metabolism-associated disorder.

The scRNA-seq data from liver specimens of healthy and NAFLD subjects from GSE182365 was examined in the current study. High-dimensional weighted gene co-expression network analysis and seven machine learning (ML) approaches were employed to identify and confirm a genetic profile that precisely detects this population. Subsequently, we utilized MCP counter and non-negative matrix factorization to explore immune cell infiltration patterns. Lastly, the comparative expression of seven optimal feature genes in vivo and in vitro were confirmed through wet experiments.

Single-cell sequencing findings unveiled a distinct fibroblast subset, differentially infiltrated in NAFLD and control cohorts, expressing genes linked to NAFLD pathogenesis. HdWGCNA and ML algorithms screened and identified the essential signature defining this cohort, comprising 7 genes, with function enrichment analysis indicating their strong link to the cell cycle pathway. Notably, we established a connection between these 7 genes and immune cell infiltration and patient variability. Concurrently, wet experiments illustrated the actual expression disparities of our seven genes in NAFLD and normal models both in vivo and in vitro.

Our investigation has uncovered a distinct fibroblast population closely associated with NAFLD pathogenesis. The related genes, including CCL2, CRYAB, HSBP1, IFIT3, PSMB10, RBX1, and SPARC, with potential promoting or inhibitory roles in NAFLD pathogenesis, were subsequently identified and represented a crucial step in developing prospective clinical diagnostic method for this condition.

## Linked entities

- **Genes:** CCL2 (C-C motif chemokine ligand 2) [NCBI Gene 6347], CRYAB (crystallin alpha B) [NCBI Gene 1410], HSBP1 (heat shock factor binding protein 1) [NCBI Gene 3281], IFIT3 (interferon induced protein with tetratricopeptide repeats 3) [NCBI Gene 3437], PSMB10 (proteasome 20S subunit beta 10) [NCBI Gene 5699], RBX1 (ring-box 1) [NCBI Gene 9978], SPARC (secreted protein acidic and cysteine rich) [NCBI Gene 6678]
- **Diseases:** Nonalcoholic fatty liver disease (MONDO:0013209), NAFLD (MONDO:0013209)

## Full-text entities

- **Genes:** PBX1 (PBX homeobox 1) [NCBI Gene 5087] {aka CAKUHED}, CCL2 (C-C motif chemokine ligand 2) [NCBI Gene 6347] {aka GDCF-2, HC11, HSMCR30, MCAF, MCP-1, MCP1}, Sparc (secreted acidic cysteine rich glycoprotein) [NCBI Gene 20692] {aka BM-40, ON}, Cryab (crystallin, alpha B) [NCBI Gene 12955] {aka Crya-2, Crya2, HspB5, P23}, CXCL8 (C-X-C motif chemokine ligand 8) [NCBI Gene 3576] {aka GCP-1, GCP1, IL8, LECT, LUCT, LYNAP}, Psmb10 (proteasome (prosome, macropain) subunit, beta type 10) [NCBI Gene 19171] {aka Mecl-1, Mecl1}, IFIT3 (interferon induced protein with tetratricopeptide repeats 3) [NCBI Gene 3437] {aka CIG-49, GARG-49, IFI60, IFIT4, IRG2, ISG60}, Bcl2 (B cell leukemia/lymphoma 2) [NCBI Gene 12043] {aka Bcl-2, C430015F12Rik, D630044D05Rik, D830018M01Rik}, CRYAB (crystallin alpha B) [NCBI Gene 1410] {aka CMD1II, CRYA2, CTPP2, CTRCT16, HEL-S-101, HSPB5}, KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}, Hsbp1 (heat shock factor binding protein 1) [NCBI Gene 68196] {aka 0610007A03Rik, Hsp25}, Ifit3 (interferon-induced protein with tetratricopeptide repeats 3) [NCBI Gene 15959] {aka Ifi49, P49}, SPARC (secreted protein acidic and cysteine rich) [NCBI Gene 6678] {aka BM-40, OI17, ON, ONT}, CCL5 (C-C motif chemokine ligand 5) [NCBI Gene 6352] {aka D17S136E, RANTES, SCYA5, SIS-delta, SISd, TCP228}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, RBX1 (ring-box 1) [NCBI Gene 9978] {aka BA554C12.1, RNF75, ROC1}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, CXCL12 (C-X-C motif chemokine ligand 12) [NCBI Gene 6387] {aka IRH, PBSF, SCYB12, SDF1, TLSF, TPAR1}, Casp3 (caspase 3) [NCBI Gene 12367] {aka A830040C14Rik, AC-3, CASP-3, CC3, CPP-32, CPP32}, Ccr2 (C-C motif chemokine receptor 2) [NCBI Gene 12772] {aka Cc-ckr-2, Ccr2a, Ccr2b, Ckr2, Ckr2a, Ckr2b}, Ccl2 (C-C motif chemokine ligand 2) [NCBI Gene 20296] {aka HC11, JE, MCAF, MCP-1, MCP1, SMC-CF}, Rb1cc1 (RB1-inducible coiled-coil 1) [NCBI Gene 12421] {aka 2900055E04Rik, 5930404L04Rik, Cc1, FIP200, LaXp180}, IL2 (interleukin 2) [NCBI Gene 3558] {aka IL-2, TCGF, lymphokine}, STAT5A (signal transducer and activator of transcription 5A) [NCBI Gene 6776] {aka MGF, STAT5}, CD46 (CD46 molecule) [NCBI Gene 4179] {aka AHUS2, MCP, MIC10, TLX, TRA2.10}, PSMB10 (proteasome 20S subunit beta 10) [NCBI Gene 5699] {aka IMD121, LMP10, MECL1, PRAAS5, beta2i}, HSBP1 (heat shock factor binding protein 1) [NCBI Gene 3281] {aka NPC-A-13}, Rbx1 (ring-box 1) [NCBI Gene 56438] {aka 1500002P15Rik, ROC1}, GAPDH (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 2597] {aka G3PD, GAPD, HEL-S-162eP}, Bax (BCL2-associated X protein) [NCBI Gene 12028], Cd14 (CD14 antigen) [NCBI Gene 12475], Hsf1 (heat shock factor 1) [NCBI Gene 15499] {aka HSTF, HSTF 1, Hsf1alpha, Hsf1beta}, Atg13 (autophagy related 13) [NCBI Gene 51897] {aka 1110053A20Rik, D2Ertd391e, Harbi1}, Pbx1 (pre B cell leukemia homeobox 1) [NCBI Gene 18514] {aka 2310056B04Rik, D230003C07Rik, Pbx-1}
- **Diseases:** metabolic abnormalities (MESH:D008659), inflammation (MESH:D007249), dyslipidemia (MESH:D050171), MAFLD (MESH:D005234), pain (MESH:D010146), weight loss (MESH:D015431), weakness (MESH:D018908), metabolic syndrome (MESH:D024821), NAFLD (MESH:D065626), hepatocyte damage (MESH:D020263), Fanconi anemia (MESH:D005199), Staphylococcus aureus infection (MESH:D013203), liver fibrosis (MESH:D008103), cirrhosis (MESH:D005355), hepatocellular carcinomas (MESH:D006528), Amoebiasis (MESH:D000562), pertussis (MESH:D014917), T2DM (MESH:D003924), hypertension (MESH:D006973), liver condition (MESH:D017093), CD (MESH:D003424), infections (MESH:D007239), FGs (MESH:C537297), obesity (MESH:D009765), African trypanosomiasis (MESH:D014353)
- **Chemicals:** hypotaurine (MESH:C003949), TG (MESH:D014280), palmitic acid (MESH:D019308), folate (MESH:D005492), water (MESH:D014867), isoflurane (MESH:D007530), CO2 (MESH:D002245), paraffin (MESH:D010232), cholesterol (MESH:D002784), pentose (MESH:D010429), oleic acid (MESH:D019301), bile acid (MESH:D001647), Oil Red O (MESH:C011049), PA (MESH:D011478), taurine (MESH:D013654), glucuronate (MESH:D020723), PVDF (MESH:C024865), H&amp;E (MESH:D006371), OA (MESH:D019319), paraformaldehyde (MESH:C003043), carbon (MESH:D002244), penicillin (MESH:D010406), streptomycin (MESH:D013307), pentose phosphate (MESH:D010428), ascorbate (MESH:D001205), FFAs (MESH:D005230), fat (MESH:D005223), nicotine (MESH:D009538), DMEM (-), lipid (MESH:D008055), TRIzol (MESH:C411644), SDS (MESH:D012967)
- **Species:** gut metagenome (species) [taxon 749906], Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** LX2 — Homo sapiens (Human), Transformed cell line (CVCL_5792), fibroblasts — Mus musculus (Mouse), Spontaneously immortalized cell line (CVCL_0594)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12863513/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12863513/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12863513/full.md

---
Source: https://tomesphere.com/paper/PMC12863513