# SOX7: Autism associated gene identified by analysis of multi-Omics data

**Authors:** Samantha Gonzales, Jane Zizhen Zhao, Na Young Choi, Prabha Acharya, Sehoon Jeong, Xuexia Wang, Moo-Yeal Lee, Chunyu Liu, Chunyu Liu, Chunyu Liu, Chunyu Liu

PMC · DOI: 10.1371/journal.pone.0320096 · PLOS One · 2025-05-15

## TL;DR

This study identifies SOX7 as an autism-associated gene by combining DNA and RNA data from large datasets, suggesting it could help in diagnosing and treating autism.

## Contribution

The study identifies SOX7 as a novel autism-associated gene through integrated multi-Omics analysis of GWAS and RNA-seq data.

## Key findings

- SOX7 was significantly associated with ASD in both discovery and replication datasets.
- SOX7 showed differential expression in RNA-seq data, being upregulated in ASD cases.
- SOX7 is a transcription factor that may contribute to autism through protein complex interactions.

## Abstract

Genome-wide association studies and next generation sequencing data analyses based on DNA information have identified thousands of mutations associated with autism spectrum disorder (ASD). However, more than 99% of identified mutations are non-coding. Thus, it is unclear which of these mutations might be functional and thus potentially causal variants. Transcriptomic profiling using total RNA-sequencing has been one of the most utilized approaches to link protein levels to genetic information at the molecular level. The transcriptome captures molecular genomic complexity that the DNA sequence solely does not. Some mutations alter a gene’s DNA sequence but do not necessarily change expression and/or protein function. To date, few common variants reliably associated with the diagnosis status of ASD despite consistently high estimates of heritability. In addition, reliable biomarkers used to diagnose ASD or molecular mechanisms to define the severity of ASD do not exist. Therefore, it is necessary to integrate DNA and RNA testing together to identify true causal genes and propose useful biomarkers for ASD. We performed gene-based association studies with adaptive test using genome-wide association studies’ (GWAS) summary statistics with two large GWAS datasets (ASD 2019 data: 18,382 ASD cases and 27,969 controls [discovery data]; ASD 2017 data: 6,197 ASD cases and 7,377 controls [replication data]) which were obtained from the Psychiatric Genomics Consortium (PGC). In addition, we investigated differential expression between ASD cases and controls for genes identified in gene-based GWAS with two RNA-seq datasets (GSE211154: 20 cases and 19 controls; GSE30573: 3 cases and 3 controls). We identified 5 genes significantly associated with ASD in ASD 2019 data (KIZ-AS1, p = 8.67 × 10-10; KIZ, p = 1.16 × 10-9; XRN2, p = 7.73 × 10-9; SOX7, p = 2.22 × 10-7; LOC101929229 also known as PINX1-DT, p = 2.14 × 10-6). Among these 5 genes, gene SOX7 (p = 0.00087) and LOC101929229 (p = 0.009) were replicated in ASD 2017 data. KIZ-AS1 (p = 0.059) and KIZ (p = 0.06) were close to the boundary of replication in ASD 2017 data. Genes SOX7 (p = 0.036 in all samples; p = 0.044 in white samples) indicated significant expression differences between cases and controls in the GSE211154 RNA-seq data. Furthermore, gene SOX7 was upregulated in cases than in controls in the GSE30573 RNA-seq data (p = 0.0017; Benjamini-Hochberg adjusted p = 0.0085). SOX7 encodes a member of the SOX (SRY-related HMG-box) family of transcription factors pivotally contributing to determining of the cell fate and identity in many lineages. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins leading to autism. Gene SOX7 in the transcription factor family could be associated with ASD. This finding may provide new diagnostic and therapeutic strategies for ASD.

## Linked entities

- **Genes:** SOX7 (SRY-box transcription factor 7) [NCBI Gene 83595], KIZ-AS1 (KIZ antisense RNA 1) [NCBI Gene 101929591], KIZ (kizuna centrosomal protein) [NCBI Gene 55857], XRN2 (5'-3' exoribonuclease 2) [NCBI Gene 22803], PINX1-DT (PINX1 divergent transcript) [NCBI Gene 101929229]
- **Diseases:** autism spectrum disorder (MONDO:0005258), ASD (MONDO:0006664)

## Full-text entities

- **Genes:** PINX1-DT (PINX1 divergent transcript) [NCBI Gene 101929229], MACROD2 (mono-ADP ribosylhydrolase 2) [NCBI Gene 140733] {aka C20orf133, C2orf133}, ASTN2 (astrotactin 2) [NCBI Gene 23245] {aka bA67K19.1}, KIZ-AS1 (KIZ antisense RNA 1) [NCBI Gene 101929591], PLA2G4A (phospholipase A2 group IVA) [NCBI Gene 5321] {aka GURDP, PLA2G4, cPLA2, cPLA2-alpha}, SHANK3 (SH3 and multiple ankyrin repeat domains 3) [NCBI Gene 85358] {aka DEL22q13.3, PROSAP2, PSAP2, SCZD15, SPANK-2}, HDAC4 (histone deacetylase 4) [NCBI Gene 9759] {aka AHO3, BDMR, HA6116, HD4, HDAC-4, HDAC-A}, CLCN5 (Cl-/H+ antiporter 5) [NCBI Gene 1184] {aka CLC5, CLCK2, ClC-5, DENT1, DENTS, NPHL1}, NLGN3 (neuroligin 3) [NCBI Gene 54413] {aka HNL3}, CNTN4 (contactin 4) [NCBI Gene 152330] {aka AXCAM, BIG-2}, GRIK2 (glutamate ionotropic receptor kainate type subunit 2) [NCBI Gene 2898] {aka EAA4, GLR6, GLUK6, GLUR6, GluK2, MRT6}, NLGN4X (neuroligin 4 X-linked) [NCBI Gene 57502] {aka ASPGX2, AUTSX2, HLNX, HNL4X, NLGN4}, RIMS4 (regulating synaptic membrane exocytosis 4) [NCBI Gene 140730] {aka C20orf190, RIM 4, RIM-4, RIM4, RIM4-gamma, RIM4gamma}, SRY (sex determining region Y) [NCBI Gene 6736] {aka SRXX1, SRXY1, TDF, TDY}, MECP2 (methyl-CpG binding protein 2) [NCBI Gene 4204] {aka AUTSX3, MRX16, MRX79, MRXS13, MRXSL, PPMX}, FOXP1 (forkhead box P1) [NCBI Gene 27086] {aka 12CC4, HSPC215, MFH, QRF1, hFKH1B}, KALRN (kalirin RhoGEF kinase) [NCBI Gene 8997] {aka ARHGEF24, CHDS5, DUET, DUO, HAPIP, KALNC2}, NINL (ninein like) [NCBI Gene 22981] {aka NLP}, SHH (sonic hedgehog signaling molecule) [NCBI Gene 6469] {aka HHG1, HLP3, HPE3, MCOPCB5, SMMCI, ShhNC}, ASMT (acetylserotonin O-methyltransferase) [NCBI Gene 438] {aka ASMTY, HIOMT, HIOMTY}, NRXN1 (neurexin 1) [NCBI Gene 9378] {aka Hs.22998, PTHSL2, SCZD17}, Qsox1 (quiescin Q6 sulfhydryl oxidase 1) [NCBI Gene 104009] {aka 1300003H02Rik, QSOX, Qscn6, SOx, b2b2673Clo}, XRN2 (5'-3' exoribonuclease 2) [NCBI Gene 22803], Sox7 (SRY (sex determining region Y)-box 7) [NCBI Gene 20680], PINX1 (PIN2 (TERF1) interacting telomerase inhibitor 1) [NCBI Gene 54984] {aka Gno1, LPTL, LPTS, Pxr1}, KIZ (kizuna centrosomal protein) [NCBI Gene 55857] {aka C20orf19, HT013, Kizuna, NCRNA00153, PLK1S1, RP69}, ADNP (activity dependent neuroprotector homeobox) [NCBI Gene 23394] {aka ADNP1, HVDAS, MRD28}, EXT1 (exostosin glycosyltransferase 1) [NCBI Gene 2131] {aka EXT, LGCR, LGS, TRPS2, TTV}, SOX7 (SRY-box transcription factor 7) [NCBI Gene 83595]
- **Diseases:** pervasive developmental disorder (MESH:D002659), neurological disease (MESH:D020271), delay of motor and speech development (MESH:D007805), glioma (MESH:D005910), epilepsy (MESH:D004827), HGG (MESH:D008228), Tourette Syndrome (MESH:D005879), neurological disorders (MESH:D009461), ASD (MESH:D000067877), developmental delay (MESH:D002658), ADHD (MESH:D001289), schizophrenia (MESH:D012559), vascular defects (MESH:D057772), congenital heart defects (MESH:D006330), hypoxic (MESH:D002534), cancer (MESH:D009369), Autism (MESH:D001321), death (MESH:D003643), OCD (MESH:D009771), Asperger's syndrome (MESH:D020817), deficiencies in cerebellar function (MESH:D002526), social communication deficits (MESH:D003147), ID (MESH:D008607), AT (MESH:D013736), repetitive compulsive (MESH:D003193), 22q11 deletion (MESH:D058165), gastrointestinal problems (MESH:D012817)
- **Chemicals:** Chunyu (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Mutations:** rs7836366, rs4841432, rs10100209, rs7005905, rs7009920
- **Cell lines:** GSE211154 — Konosirus punctatus (Dotted gizzard shad), Spontaneously immortalized cell line (CVCL_6F81)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12080844/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12080844/full.md

## References

123 references — full list in the complete paper: https://tomesphere.com/paper/PMC12080844/full.md

---
Source: https://tomesphere.com/paper/PMC12080844