# Causal splicing variants revealed by deep-learning integration of single-cell sQTL mapping under influenza infection

**Authors:** Liuyang Wang, Guinevere Connelly, Trisha Dalapati, Angela Jones, Benjamin Schott, Joseph Trimarco, Nicholas Heaton, Dennis Ko

PMC · DOI: 10.21203/rs.3.rs-8408992/v1 · Research Square · 2026-01-06

## TL;DR

This study identifies genetic variants that affect splicing and influence disease susceptibility, particularly during influenza infection.

## Contribution

The integration of deep learning and single-cell sQTL mapping reveals causal splicing variants and their effects on protein isoforms and disease traits.

## Key findings

- 76 likely causal variants affecting splicing components were identified, including rs2297616 in PARP2.
- The causal variant rs2297616 alters splicing and increases influenza A virus levels in cells.
- These variants are linked to over 100 GWAS traits, including autoimmune diseases.

## Abstract

Fulfilling the promise of human genetics in elucidating disease requires identifying causal variants and genes underlying genetic association signals. Molecular quantitative trait locus (molQTL) analyses, e.g. expression QTL (eQTL) and splicing QTL (sQTL), link genetic variants to intermediate molecular phenotypes, but pinpointing causal variants and their regulatory effects remains challenging. Here, we integrate sQTL analysis with deep-learning-based splicing effect annotation to identify causal genetic variants and elucidate their functional mechanisms affecting human phenotypes.

Using a single-cell GWAS method (scHi-HOST) on 96 lymphoblastoid cell lines (LCLs) with and without influenza A virus (IAV) infection, we discovered ~ 43,000 sQTLs associated with 217 genes after IAV infection. Integrating sQTLs with AI splice prediction, we uncovered 76 likely causal variants that affect cis-acting molecular splicing components (5’ donor, 3’ acceptor), supported by further computational analysis. Among these, we experimentally validated a causal sQTL signal affecting poly (ADP-ribose) polymerase 2 (PARP2). The causal variant, rs2297616, alters the 5’ splice donor site in the second intron of PARP2, resulting in two protein isoforms differing by 13 amino acids. The derived A allele was associated with the longer protein isoform and increased IAV levels in LCLs. CRISPR editing validated the causal effect of this variant on both protein length and IAV infection. Lastly, these 76 putative causal sQTLs were further linked to over a hundred GWAS traits, including many variants associated with autoimmune diseases.

Our work provides a catalog of causal sQTL with direct splicing impacts, providing causal mechanistic insights from genotype to disease susceptibility.

## Linked entities

- **Genes:** PARP2 (poly(ADP-ribose) polymerase 2) [NCBI Gene 10038]

## Full-text entities

- **Genes:** HSP90B1 (heat shock protein 90 beta family member 1) [NCBI Gene 7184] {aka ECGP, GP96, GRP94, HEL-S-125m, HEL35, TRA1}, MX1 (MX dynamin like GTPase 1) [NCBI Gene 4599] {aka IFI-78K, IFI78, MX, MxA, lncMX1-215}, IFITM1 (interferon induced transmembrane protein 1) [NCBI Gene 8519] {aka 9-27, CD225, DSPA2a, IFI17, LEU13}, PARP14 (poly(ADP-ribose) polymerase family member 14) [NCBI Gene 54625] {aka ARTD8, BAL2, PARP-14, pART8}, PARP1 (poly(ADP-ribose) polymerase 1) [NCBI Gene 142] {aka ADPRT, ADPRT 1, ADPRT1, ARTD1, PARP, PARP-1}, OAS1 (2'-5'-oligoadenylate synthetase 1) [NCBI Gene 4938] {aka E18/E16, IFI-4, IMD100, OIAS, OIASI}, PARP11 (poly(ADP-ribose) polymerase family member 11) [NCBI Gene 57097] {aka ARTD11, C12orf6, MIB006}, IFNA1 (interferon alpha 1) [NCBI Gene 3439] {aka IFL, IFN, IFN-ALPHA, IFN-alphaD, IFNA13, IFNA@}, OTUD6B-AS1 (OTUD6B antisense RNA 1) [NCBI Gene 100506365] {aka GS1-251I9.4}, PARP2 (poly(ADP-ribose) polymerase 2) [NCBI Gene 10038] {aka ADPRT2, ADPRTL2, ADPRTL3, ARTD2, PARP-2, pADPRT-2}, LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}, U2AF1 (U2 small nuclear RNA auxiliary factor 1) [NCBI Gene 7307] {aka FP793, RN, RNU2AF1, U2AF35, U2AFBP}, PARP12 (poly(ADP-ribose) polymerase family member 12) [NCBI Gene 64761] {aka ARTD12, MST109, MSTP109, ZC3H1, ZC3HDC1}, NAP1L4 (nucleosome assembly protein 1 like 4) [NCBI Gene 4676] {aka NAP1L4b, NAP2, NAP2L, hNAP2}, IFI44 (interferon induced protein 44) [NCBI Gene 10561] {aka MTAP44, TLDC5, p44}, ARTN (artemin) [NCBI Gene 9048] {aka ART, ENOVIN, EVN, NBN}, U2AF1L4 (U2 small nuclear RNA auxiliary factor 1 like 4) [NCBI Gene 199746] {aka U2AF1-RS3, U2AF1L3, U2AF1L3V1, U2AF1RS3, U2af26}, HLA-C (major histocompatibility complex, class I, C) [NCBI Gene 3107] {aka D6S204, HLA-JY3, HLAC, HLC-C, MHC, PSORS1}, IFNG (interferon gamma) [NCBI Gene 3458] {aka IFG, IFI, IMD69}, IFNAR2 (interferon alpha and beta receptor subunit 2) [NCBI Gene 3455] {aka IFN-R, IFN-R-2, IFN-alpha-REC, IFNABR, IFNARB, IMD45}, Parp2 (poly (ADP-ribose) polymerase 2) [NCBI Gene 290027] {aka Adprtl2}, ZC3HAV1 (zinc finger CCCH-type containing, antiviral 1) [NCBI Gene 56829] {aka ARTD13, FLB6421, PARP13, ZAP, ZC3H2, ZC3HDC2}
- **Diseases:** hepatitis B virus infection (MESH:D006509), infectious, autoimmune, and metabolic disorders (MESH:D003141), genetic disease (MESH:D030342), asthma (MESH:D001249), autoimmune diseases (MESH:D001327), herpes simplex virus 1 (HSV-1) infection (MESH:D006561), infected (MESH:D007239), HOST (MESH:D006086), multiple sclerosis (MESH:D009103), Epstein-Barr virus infection (MESH:D020031), hepatitis virus infection (MESH:D006525), CA09 IAV infection (MESH:D007251), viral (MESH:D014777), COVID-19 (MESH:D000086382), deaths (MESH:D003643)
- **Chemicals:** L-glutamine (MESH:D005973), DMEM (-), Penicillin-G (MESH:D010400), SDS (MESH:D012967), fatty acid (MESH:D005227), 7-AAD (MESH:C025942), NaF (MESH:D012969), Agarose (MESH:D012685), TPCK (MESH:D014108), HCl (MESH:D006851), ethidium bromide (MESH:D004996), CO2 (MESH:D002245), NAD + (MESH:D009243), PBS (MESH:D007854), NaCl (MESH:D012965), ADP-ribose (MESH:D000246), Streptomycin (MESH:D013307), TC (MESH:D013667), Triton X-100 (MESH:D017830), PVDF (MESH:C024865), sucrose (MESH:D013395)
- **Species:** Influenza A virus (no rank) [taxon 11320], Homo sapiens (human, species) [taxon 9606], Zika virus (no rank) [taxon 64320], West Nile virus (no rank) [taxon 11082], H1N1 subtype (serotype) [taxon 114727], Rattus norvegicus (brown rat, species) [taxon 10116], Human alphaherpesvirus 1 (Herpes simplex virus type 1, no rank) [taxon 10298], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Mus musculus (house mouse, species) [taxon 10090]
- **Mutations:** M0530S, rs2297616, rs10774671, rs17638853
- **Cell lines:** A549 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_0023), A549s — Mus musculus (Mouse), Hybridoma (CVCL_U609), MDCK — Canis lupus familiaris (Dog), Spontaneously immortalized cell line (CVCL_0422), D5 — Bos taurus (Bovine), Hybrid cell line (CVCL_9256)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12803322/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12803322/full.md

## References

91 references — full list in the complete paper: https://tomesphere.com/paper/PMC12803322/full.md

---
Source: https://tomesphere.com/paper/PMC12803322