# Deep-embedded clustering by relevant scales and genome-wide association study in autism

**Authors:** Fumihiko Ueno, Ippei Takahashi, Hisashi Ohseto, Tomomi Onuma, Akira Narita, Taku Obara, Mami Ishikuro, Keiko Murakami, Aoi Noda, Fumiko Matsuzaki, Hirohito Metoki, Gen Tamiya, Shigeo Kure, Shinichi Kuriyama, Mohith Manjunath, Mohith Manjunath, Mohith Manjunath

PMC · DOI: 10.1371/journal.pone.0322698 · PLOS One · 2025-05-29

## TL;DR

This study explores the genetic basis of autism by clustering patients and performing genome-wide association studies, finding multiple significant loci linked to autism-related genes.

## Contribution

The novel use of cluster-based GWAS in autism research identifies new genetic loci that may be relevant to ASD.

## Key findings

- Cluster-based GWAS identified 27 chromosomal loci associated with autism at p < 5.0 × 10⁻⁸.
- Several identified loci are near genes previously linked to autism spectrum disorder.
- Results were not fully replicable across different populations, suggesting potential technical variability.

## Abstract

Autism spectrum disorder (ASD) presents with heterogeneous phenotypic and genetic characteristics. Despite investigation into the molecular mechanisms underlying ASD, its etiology remains elusive. In our previous investigation within the Simons Simplex Collection (SSC), we noted increased signals through a genome-wide association study (GWAS) by clustering patients with ASD and reducing the sample size. This study seeks to validate our previous study in a different population, the Simons Foundation Powering Autism Research for Knowledge (SPARK) population, while probing further into the genetic architecture of ASD. We examined data from 2,079 white male subjects and 875 unaffected SPARK siblings. Our methodology encompassed cluster analyses, followed by traditional GWAS and cluster-based GWAS (cGWAS). No significant associations were observed in the conventional GWAS when comparing all patients with all controls. However, in the cGWAS, by comparing patients clustered by phenotypes with controls, we identified 27 chromosomal loci meeting the criteria of p < 5.0 × 10 ⁻ 8. Remarkably, several of these loci were situated within or in proximity to genes previously implicated as candidates for ASD. Nonetheless, our previous study of the SSC population did not fully replicate the SPARK population. The absence of reproducibility suggests the possibility of false positives within the cGWAS results due to potential technical factors. However, the emergence of multiple signals post-clustering and the association of numerous identified gene regions with ASD and related disorders provide supporting evidence for the validity of cGWAS outcomes.

## Linked entities

- **Diseases:** autism spectrum disorder (MONDO:0005258), ASD (MONDO:0006664)

## Full-text entities

- **Diseases:** ASD (MESH:D000067877), Autism (MESH:D001321)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12122041/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12122041/full.md

## References

94 references — full list in the complete paper: https://tomesphere.com/paper/PMC12122041/full.md

---
Source: https://tomesphere.com/paper/PMC12122041