# Decoding the Role of H19 in Cholestatic Liver Injury Using snRNA-seq, Spatial Transcriptomics, and Machine Learning-Based Disease Prediction

**Authors:** Grayson Welch Way, Xixian Jiang, Hongkun Lu, Nan Wu, Derrick Zhao, Yun-ling Tai, Sareh Bayatpour, Xuan Wang, Huiping Zhou

PMC · DOI: 10.21203/rs.3.rs-8339668/v1 · Research Square · 2026-01-14

## TL;DR

This study explores how the long non-coding RNA H19 contributes to liver disease progression and identifies its role in cholangiocyte dysfunction using advanced sequencing and machine learning.

## Contribution

The study reveals H19's role in sustaining pathogenic cholangiocyte states and identifies novel markers and signaling pathways in cholestatic liver injury.

## Key findings

- H19 deletion reduces disease-associated cholangiocyte expansion and restores bile-duct gene expression.
- SPP1 signaling is dysregulated in cholestatic injury and normalized with H19 deletion.
- Machine learning models achieved high accuracy in predicting disease states using cell-type-specific signatures.

## Abstract

Primary Sclerosing Cholangitis (PSC) is a chronic obstructive biliary disease and remains a high-burden cholestatic liver disease with no approved therapies and a substantial recurrence rate following liver transplantation. The long non-coding RNA H19 (H19) has emerged as a potential driver of PSC progression, yet its cell-type-specific and spatially resolved mechanisms remain poorly defined.

Age- and sex-matched wild type (WT), H19 knockout (H19KO), Mdr2 knockout (Mdr2KO), and double-knockout (DKO; Mdr2KO/H19KO) mice were used. The liver tissues were analyzed using single nucleus RNA sequencing (snRNAseq) and NanoString GeoMx spatial transcriptomics to elucidate H19-dependent cellular and spatial alternations in cholestatic liver injury. Machine learning models (logistic regression, XGBoost, neural network, and random forest) were developed to generate cell-type specific disease prediction signatures and validated using the publicly available human dataset GSE243981. Both spatial transcriptomics and snRNAseq identified a disease-associated cholangiocyte subcluster that was significantly expanded in Mdr2KO mice, but markedly diminished in DKO mice, demonstrating a requirement for H19 in sustaining pathogenic cholangiocyte state. SPP1 signaling was significantly dysregulated in cholestatic liver injury and ameliorated with H19 deletion. Novel murine markers were identified, including Gm13775 (healthy hepatocytes) and Clu and Spp1 (healthy cholangiocytes), all of which were markedly downregulaed in disease. Machine learning-based, cell type-specific disease prediction models achieved AUC values > 0.87 when validated in the GSE243981 human dataset. Noteably,Spp1 expression decreased in cholangiocytes but was ectopically upregulated in hepatocytes in diseased liver, highlighting disrupted intercellular signaling network. Spatial analyses showed that H19 deletion restored the disease-associated gene expression changes specifically within the bile duct region.

H19 deletion mitigates cholestatic injury by suppressing pathogenic cholangiocyte states, normalizing SPP1-mediated signaling, and restoring bile-duct-localized transcriptional programs. These findings position H19 as a critical regulator of cholangiocyte-driven pathology and a potential therapeutic target in PSC.

## Linked entities

- **Genes:** H19 (H19 imprinted maternally expressed transcript) [NCBI Gene 283120], SPP1 (secreted phosphoprotein 1) [NCBI Gene 6696], Gm13775 (predicted gene 13775) [NCBI Gene 102632173], CLU (clusterin) [NCBI Gene 1191]
- **Diseases:** Primary Sclerosing Cholangitis (MONDO:0013433)
- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** Sox9 (SRY (sex determining region Y)-box 9) [NCBI Gene 20682] {aka 2010306G03Rik, mKIAA4243, mSox9}, Apoa1 (apolipoprotein A-I) [NCBI Gene 11806] {aka Alp-1, Apoa-1, Brp-14, Ltw-1, Lvtw-1, Sep-1}, Malat1 (metastasis associated lung adenocarcinoma transcript 1 (non-coding RNA)) [NCBI Gene 72289] {aka 2210401K01Rik, 9430072K23Rik, Neat2}, Cd44 (CD44 antigen) [NCBI Gene 12505] {aka HERMES, Ly-24, Pgp-1}, Nfkb1 (nuclear factor of kappa light polypeptide gene enhancer in B cells 1, p105) [NCBI Gene 18033] {aka NF-KB1, NF-kappaB, NF-kappaB1, p105, p50, p50/p105}, Ahsg (alpha-2-HS-glycoprotein) [NCBI Gene 11625], Rpl13a (ribosomal protein L13A) [NCBI Gene 22121] {aka 1810026N22Rik, Tstap198-7, tum-antigen}, Slco3a1 (solute carrier organic anion transporter family, member 3a1) [NCBI Gene 108116] {aka 5830414C08Rik, Anr1, MJAM, OATP-D, Slc21a11}, Sox4 (SRY (sex determining region Y)-box 4) [NCBI Gene 20677] {aka Sox-4}, Frmd4b (FERM domain containing 4B) [NCBI Gene 232288] {aka 6030440G05Rik, GOBLIN, GRSP1}, Fga (fibrinogen alpha chain) [NCBI Gene 14161] {aka Fib}, Fasn (fatty acid synthase) [NCBI Gene 14104] {aka A630082H08Rik, FAS}, Apob (apolipoprotein B) [NCBI Gene 238055] {aka Apo B-100, apob-100, apob-48}, Gm13775 (predicted gene 13775) [NCBI Gene 102632173], Npas2 (neuronal PAS domain protein 2) [NCBI Gene 18143] {aka MOP4, bHLHe9}, Apoa2 (apolipoprotein A-II) [NCBI Gene 11807] {aka Alp-2, Apo-AII, ApoA-II, ApoAII, Apoa-2, Hdl-1}, Itgb6 (integrin beta 6) [NCBI Gene 16420] {aka 2210409C20Rik, 4831415H04Rik}, Tnfrsf12a (tumor necrosis factor receptor superfamily, member 12a) [NCBI Gene 27279] {aka Fn14, HPIP, TWEAK-R, TweakR}, Apoa5 (apolipoprotein A-V) [NCBI Gene 66113] {aka 1300007O05Rik, Apoav, RAP3}, Ces1c (carboxylesterase 1C) [NCBI Gene 13884] {aka Ces-N, Ee-1, Ee1, Es-4, Es-N, Es1}, Cd74 (CD74 antigen (invariant polypeptide of major histocompatibility complex, class II antigen-associated)) [NCBI Gene 16149] {aka CLIP, DHLAG, HLADG, Ia-GAMMA, Ii}, Anxa3 (annexin A3) [NCBI Gene 11745] {aka Anx3}, Hspa8 (heat shock protein family A (Hsp70) member 8) [NCBI Gene 15481] {aka 2410008N15Rik, Hsc70, Hsc71, Hsc73, Hsp73, Hspa10}, Lama4 (laminin, alpha 4) [NCBI Gene 16775], Usp2 (ubiquitin specific peptidase 2) [NCBI Gene 53376] {aka B930035K21Rik, Ubp41}, Rnf128 (ring finger protein 128) [NCBI Gene 66889] {aka 1300002C13Rik, GRAIL, Greul1}, Cftr (cystic fibrosis transmembrane conductance regulator) [NCBI Gene 12638] {aka Abcc7}, Xist (inactive X specific transcripts) [NCBI Gene 213742] {aka A430022B11}, Rhbdf2 (rhomboid 5 homolog 2) [NCBI Gene 217344] {aka 4732465I17Rik, Rhbdl6, Uncv, cub}, Wwc1 (WW, C2 and coiled-coil domain containing 1) [NCBI Gene 211652] {aka Kibra}, Cdh1 (cadherin 1) [NCBI Gene 12550] {aka ARC-1, E-cad, Ecad, L-CAM, UVO, Um}, 1600012H06Rik (RIKEN cDNA 1600012H06 gene) [NCBI Gene 67912], Mat1a (methionine adenosyltransferase 1A) [NCBI Gene 11720] {aka AdoMet, Ams, MAT, MATA1, SAMS, SAMS1}, Itgav (integrin alpha V) [NCBI Gene 16410] {aka 1110004F14Rik, 2610028E01Rik, CD51, D430040G12Rik}, Spp1 (secreted phosphoprotein 1) [NCBI Gene 20750] {aka 2AR, Apl-1, BNSP, BSPI, Bsp, ETA-1}, Hspa2 (heat shock protein family A (Hsp70) member 2) [NCBI Gene 15512] {aka 70kDa, HSP70.2, HSP70A2, Hsp70-2}, Abcb4 (ATP-binding cassette, sub-family B member 4) [NCBI Gene 18670] {aka Mdr2, Pgy-2, Pgy2, mdr-2}, Gnas (GNAS complex locus) [NCBI Gene 14683] {aka 5530400H20Rik, A930027G11Rik, C130027O20Rik, GPSA, GSP, Galphas}, Ikbkb (inhibitor of kappaB kinase beta) [NCBI Gene 16150] {aka IKK-2, IKK-B, IKK-beta, IKK2, IKK[b], IKKbeta}, Fn1 (fibronectin 1) [NCBI Gene 14268] {aka E330027I09, Fn, Fn-1}, Hspa5 (heat shock protein family A (Hsp70) member 5) [NCBI Gene 14828] {aka Bip, D2Wsu141e, D2Wsu17e, Grp78, Hsce70, SEZ-7}, Csmd1 (CUB and Sushi multiple domains 1) [NCBI Gene 94109] {aka B930082H09, mKIAA1890}, Itgb1 (integrin beta 1 (fibronectin receptor beta)) [NCBI Gene 16412] {aka 4633401G24Rik, CD29, Fnrb, Gm9863, gpIIa}, Gphn (gephyrin) [NCBI Gene 268566] {aka 5730552E08Rik, C230040D23, GPH, GPHRYN, geph}, Tsix (X (inactive)-specific transcript, opposite strand) [NCBI Gene 22097], Hsp90aa1 (heat shock protein 90, alpha (cytosolic), class A member 1) [NCBI Gene 15519] {aka 86kDa, 89kDa, Hsp86-1, Hsp89, Hsp90, Hspca}, Clu (clusterin) [NCBI Gene 12759] {aka ApoJ, Cli, D14Ucla3, SP-40, Sgp-2, Sgp2}, Gbp9 (guanylate-binding protein 9) [NCBI Gene 236573], Gm19951 [NCBI Gene 100503902], Hamp (hepcidin antimicrobial peptide) [NCBI Gene 84506] {aka Hamp1, Hepc, Hepc1}, Clec2d (C-type lectin domain family 2, member d) [NCBI Gene 93694] {aka Clr-b, Clrb, Ocil}, Agmo (alkylglycerol monooxygenase) [NCBI Gene 319660] {aka A530016O06Rik, Tmem195}, Sorbs3 (sorbin and SH3 domain containing 3) [NCBI Gene 20410] {aka SCAM-1, SH3P3, Sh3d4, vinexin-g}, Itga4 (integrin alpha 4) [NCBI Gene 16401] {aka CD49D, Itga4B}, Srsf1 (serine and arginine-rich splicing factor 1) [NCBI Gene 110809] {aka 1110054N12Rik, 5730507C05Rik, 6330415C05Rik, Asf, Sf2, Sfrs1}, Ly6e (lymphocyte antigen 6 family member E) [NCBI Gene 17069] {aka Ly67, RIG-E, Sca-2, TSA-1, Tsa1}, Ugt2b5 (UDP glucuronosyltransferase 2 family, polypeptide B5) [NCBI Gene 22238] {aka M-1, Udpgt-3, Udpgt2b5, Ugt2b17}, H19 (H19, imprinted maternally expressed transcript) [NCBI Gene 14955] {aka EyeLinc6}, Chka (choline kinase alpha) [NCBI Gene 12660] {aka CK, CK/EK-alpha, Chetk-alpha, Chk, ChoK, EK}, Eef1a1 (eukaryotic translation elongation factor 1 alpha 1) [NCBI Gene 13627]
- **Diseases:** PSC (MESH:D015209), cholestasis (MESH:D002779), liver fibrosis (MESH:D008103), biliary fibrosis (MESH:D005355), HCC (MESH:D006528), malignancies (MESH:D009369), Primary Biliary Cholangitis (MESH:D008105), hepatobiliary disease (MESH:D004066), immune dysregulation (OMIM:614878), Cholestatic Liver Injury (MESH:D017093), Crohn's disease (MESH:D003424), DKO (OMIM:615441), inflammation of the intrahepatic and extrahepatic bile ducts (MESH:D001651), tumorigenesis (MESH:D063646), pancreatic ductal adenocarcinoma (MESH:D021441), Multidrug resistance 2 (MESH:D018088), colorectal carcinoma (MESH:D015179), end-stage liver failure (MESH:D007676), Cholangiocyte disease (MESH:D004194), chronic obstructive biliary disease (MESH:D029424), biliary atresia (MESH:D001656), cholestatic liver disease (MESH:D008107), gallbladder carcinoma (MESH:D005706), cholangiocarcinoma (MESH:D018281), DSP (MESH:D008569)
- **Chemicals:** Alexa Fluor (-), lipid (MESH:D008055), nitrogen (MESH:D009584), cholesterol (MESH:D002784), bile acid (MESH:D001647)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** C57BL/6J — Mus musculus (Mouse), Transformed cell line (CVCL_C0MW)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12869644/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12869644/full.md

---
Source: https://tomesphere.com/paper/PMC12869644