# Identifying individuals with rare disease variants by inferring shared ancestral haplotypes from SNP array data

**Authors:** Erandee Robertson, Bronwyn E Grinton, Karen L Oliver, Liam G Fearnley, Michael S Hildebrand, Lynette G Sadleir, Ingrid E Scheffer, Samuel F Berkovic, Mark F Bennett, Melanie Bahlo

PMC · DOI: 10.1093/nargab/lqaf033 · NAR Genomics and Bioinformatics · 2025-04-04

## TL;DR

FoundHaplo is a new method that identifies rare disease variants by detecting shared ancestral genetic patterns using SNP data.

## Contribution

FoundHaplo improves sensitivity in detecting rare disease-causing variants compared to genome-wide imputation using SNP array data.

## Key findings

- FoundHaplo successfully inferred two rare pathogenic variants in epilepsy-related genes.
- The method outperformed genome-wide imputation in identifying rare variants with low minor allele frequency.
- FoundHaplo is applicable to nonhuman and non-disease traits, including quantitative trait analysis.

## Abstract

We describe FoundHaplo, an identity-by-descent algorithm that can be used to screen untyped disease-causing variants using single nucleotide polymorphism (SNP) array data. FoundHaplo leverages knowledge of shared disease haplotypes for inherited variants to identify those who share the disease haplotype and are, therefore, likely to carry the rare [minor allele frequency (MAF) ≤ 0.01%] variant. We performed a simulation study to evaluate the performance of FoundHaplo across 33 disease-harbouring loci. FoundHaplo was used to infer the presence of two rare (MAF ≤ 0.01%) pathogenic variants, SCN1B c.363C>G (p.Cys121Trp) and WWOX c.49G>A (p.E17K), which can cause mild dominant and severe recessive epilepsy, respectively, in the Epi25 cohort and the UK Biobank. FoundHaplo demonstrated substantially better sensitivity at inferring the presence of these rare variants than existing genome-wide imputation. FoundHaplo is a valuable screening tool for searching disease-causing variants with known founder effects using only SNP genotyping data. It is also applicable to nonhuman applications and nondisease-causing traits, including rare-variant drivers of quantitative traits. The FoundHaplo algorithm is available at https://github.com/bahlolab/FoundHaplo (DOI:10.5281/zenodo.8058286).

Graphical Abstract

## Linked entities

- **Genes:** SCN1B (sodium voltage-gated channel beta subunit 1) [NCBI Gene 6324], WWOX (WW domain containing oxidoreductase) [NCBI Gene 51741]
- **Diseases:** epilepsy (MONDO:0005027)

## Full-text entities

- **Genes:** SCN1B (sodium voltage-gated channel beta subunit 1) [NCBI Gene 6324] {aka ATFB13, BRGDA5, DEE52, EIEE52, GEFSP1}, WWOX (WW domain containing oxidoreductase) [NCBI Gene 51741] {aka D16S432E, DEE28, EIEE28, FOR, FRA16D, HHCMA56}
- **Diseases:** epilepsy (MESH:D004827)
- **Mutations:** p.Cys121Trp, p.E17K

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11970371/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11970371/full.md

## References

66 references — full list in the complete paper: https://tomesphere.com/paper/PMC11970371/full.md

---
Source: https://tomesphere.com/paper/PMC11970371