# Identification of biomarkers between coronary artery disease and non-alcoholic steatohepatitis: a combination of bioinformatics and machine learning

**Authors:** Yihong Lin, Jingmei Song, Xiaohong Li

PMC · DOI: 10.3389/fgene.2025.1573621 · Frontiers in Genetics · 2025-07-17

## TL;DR

This study identifies shared genes and pathways between coronary artery disease and non-alcoholic steatohepatitis, suggesting potential biomarkers and common mechanisms.

## Contribution

The study combines bioinformatics and machine learning to discover novel shared biomarkers and mechanisms between CAD and NASH.

## Key findings

- BATF3, SOCS2, and GPER are key genes with ROC values above 0.7, validated in external datasets.
- Shared genes are enriched in insulin resistance and inflammation pathways.
- The genes are linked to metabolic, inflammatory, and cardiovascular pathways via GSEA and ssGSEA.

## Abstract

Non-alcoholic steatohepatitis (NASH) commonly complicates coronary artery disease (CAD), yet the interaction mechanism remains unclear. Our research seeks to investigate the common mechanisms and key signature genes between CAD and NASH.

RNA sequence information for CAD and NASH was screened from the GEO database. Weighted gene co-expression network analysis (WGCNA) and differentially expressed gene analysis identified key genes, followed by functional enrichment analysis of these shared genes. Three machine learning methods—LASSO, random forest, and SVM-RFE—were used to identify signature genes. Gene set enrichment analysis (GSEA) was then performed to explore potential mechanisms associated with the signature genes. In addition, single-sample gene set enrichment analysis (ssGSEA) evaluated immune infiltration in CAD and NASH and its correlation with the signature genes.

WGCNA has revealed two key modules for CAD and NASH. The intersection of the CAD modules and their differential genes narrowed the key genes down to 2,808 shared genes. Finally, 44 shared genes were selected for both CAD and NASH. Kyoto Encyclopedia of Genes and Genomes analysis showed that these genes were primarily enriched in insulin resistance and inflammation pathways. Machine learning identified the signature genes BATF3, SOCS2, and GPER, all with ROC values above 0.7, validated in external datasets. GSEA revealed that these genes act through common mechanisms in CAD and NASH, regulating metabolic, inflammatory, and cardiovascular pathways. In addition, ssGSEA suggested their involvement in immune cell infiltration.

BATF3, SOCS2, and GPER have emerged as promising gene candidates that may serve as biomarkers or potential therapeutic targets for CAD combined with NASH, linked to the regulation of metabolic, inflammatory, and cardiovascular pathways. We also identified insulin resistance and inflammation pathways as common mechanisms underlying both diseases.

## Linked entities

- **Genes:** BATF3 (basic leucine zipper ATF-like transcription factor 3) [NCBI Gene 55509], SOCS2 (suppressor of cytokine signaling 2) [NCBI Gene 8835], GPER1 (G protein-coupled estrogen receptor 1) [NCBI Gene 2852]
- **Diseases:** coronary artery disease (MONDO:0005010), non-alcoholic steatohepatitis (MONDO:0007027)

## Full-text entities

- **Genes:** PKD2 (polycystin 2, transient receptor potential cation channel) [NCBI Gene 5311] {aka APKD2, PC2, PKD4, Pc-2, TRPP2}, Gper1 (G protein-coupled estrogen receptor 1) [NCBI Gene 76854] {aka 6330420K13Rik, CMKRL2, Ceprl, FEG-1, GPCR-Br, Gper}, TGFB1 (transforming growth factor beta 1) [NCBI Gene 7040] {aka CAEND1, CED, DPD1, IBDIMDE, LAP, TGF-beta1}, ST3GAL4 (ST3 beta-galactoside alpha-2,3-sialyltransferase 4) [NCBI Gene 6484] {aka CGS23, NANTA3, SAT3, SIAT4, SIAT4C, ST-4}, MMP9 (matrix metallopeptidase 9) [NCBI Gene 4318] {aka CLG4B, GELB, MANDP2, MMP-9}, INS (insulin) [NCBI Gene 3630] {aka IDDM, IDDM1, IDDM2, ILPR, IRDN, MODY10}, TNF (tumor necrosis factor) [NCBI Gene 7124] {aka DIF, IMD127, TNF-alpha, TNFA, TNFSF2, TNLG1F}, STAT2 (signal transducer and activator of transcription 2) [NCBI Gene 6773] {aka IMD44, ISGF-3, P113, PTORCH3, STAT113}, GCKR (glucokinase regulator) [NCBI Gene 2646] {aka FGQTL5, GKRP}, CRP (C-reactive protein) [NCBI Gene 1401] {aka PTX1}, APOE (apolipoprotein E) [NCBI Gene 348] {aka AD2, APO-E, ApoE4, LDLCQ5, LPG}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, ESR1 (estrogen receptor 1) [NCBI Gene 2099] {aka ER, ESR, ESRA, ESTRR, Era, NR3A1}, NFKB1 (nuclear factor kappa B subunit 1) [NCBI Gene 4790] {aka CVID12, EBP-1, KBF1, NF-kB, NF-kB1, NF-kappa-B1}, CD36 (CD36 molecule (CD36 blood group)) [NCBI Gene 948] {aka BDPLT10, CHDS7, FAT, GP3B, GP4, GPIV}, PTK2B (protein tyrosine kinase 2 beta) [NCBI Gene 2185] {aka CADTK, CAKB, FADK2, FAK2, PKB, PTK}, CCL19 (C-C motif chemokine ligand 19) [NCBI Gene 6363] {aka CKb11, ELC, MIP-3b, MIP3B, SCYA19}, APOB (apolipoprotein B) [NCBI Gene 338] {aka FCHL2, FLDB, LDLCQ4, apoB-100, apoB-48}, PNPLA3 (patatin like domain 3, 1-acylglycerol-3-phosphate O-acyltransferase) [NCBI Gene 80339] {aka ADPN, C22orf20, iPLA(2)epsilon}, GPER1 (G protein-coupled estrogen receptor 1) [NCBI Gene 2852] {aka CEPR, CMKRL2, DRY12, FEG-1, GPCR-Br, GPER}, JAK2 (Janus kinase 2) [NCBI Gene 3717] {aka JTK10}, BATF3 (basic leucine zipper ATF-like transcription factor 3) [NCBI Gene 55509] {aka JDP1, JUNDM1, SNFT}, FSTL3 (follistatin like 3) [NCBI Gene 10272] {aka FLRG, FSRP}, STAT3 (signal transducer and activator of transcription 3) [NCBI Gene 6774] {aka ADMIO, ADMIO1, APRF, HIES}, VEGFA (vascular endothelial growth factor A) [NCBI Gene 7422] {aka L-VEGF, MVCD1, VEGF, VPF}, TM6SF2 (transmembrane 6 superfamily member 2) [NCBI Gene 53345], TIPE1 (TNF alpha induced protein 8 like 1) [NCBI Gene 126282] {aka TNFAIP8L1}, SOCS2 (suppressor of cytokine signaling 2) [NCBI Gene 8835] {aka CIS2, Cish2, SOCS-2, SSI-2, SSI2, STATI2}, ITGAE (integrin subunit alpha E) [NCBI Gene 3682] {aka CD103, HUMINAE}, CD1B (CD1b molecule) [NCBI Gene 910] {aka CD1, R1}, MLXIPL (MLX interacting protein like) [NCBI Gene 51085] {aka CHREBP, MIO, MONDOB, WBSCR14, WS-bHLH, bHLHd14}, PLCXD3 (phosphatidylinositol specific phospholipase C X domain containing 3) [NCBI Gene 345557], CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}
- **Diseases:** hypertension (MESH:D006973), carotid lesion (MESH:D002340), NAFLD (MESH:D065626), CVD (MESH:D002318), dyslipidemia (MESH:D050171), necrosis (MESH:D009336), liver disease (MESH:D008107), liver fibrosis (MESH:D008103), hepatic lipid (MESH:D011017), CAD (MESH:D003324), obesity (MESH:D009765), Acute myocardial infarction (MESH:D009203), cardiac muscle contraction (MESH:C536214), hepatocellular carcinoma (MESH:D006528), liver condition (MESH:D017093), fatty degeneration (MESH:D008067), hepatocyte injury (MESH:D014947), inflammation (MESH:D007249), NASH (MESH:D005235), type 2 diabetes (MESH:D003924), cirrhosis (MESH:D005355), atherosclerosis (MESH:D050197), Endothelial dysfunction (MESH:D014652), fatty liver disease (MESH:D005234), atherosclerotic plaques (MESH:D058226), metabolic syndrome (MESH:D024821), fat (MESH:D004620), dilated cardiomyopathy (MESH:D002311), diabetes (MESH:D003920), insulin resistance (MESH:D007333)
- **Chemicals:** cholesterol esters (MESH:D002788), chondroitin sulfate (MESH:D002809), aminoacyl-tRNA (MESH:D012346), arginine (MESH:D001120), riboflavin (MESH:D012256), alcohol (MESH:D000438), CoA (MESH:D003065), lipid (MESH:D008055), glycosaminoglycan (MESH:D006025), glycolipid (MESH:D006017), O-glycan (-), proline (MESH:D011392), glycan (MESH:D011134), homocysteine (MESH:D006710), cholesterol (MESH:D002784)
- **Species:** Rattus norvegicus (brown rat, species) [taxon 10116], Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** I148M, serine/threonine

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12310482/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12310482/full.md

## References

63 references — full list in the complete paper: https://tomesphere.com/paper/PMC12310482/full.md

---
Source: https://tomesphere.com/paper/PMC12310482