# PANGEN: an online platform for the comparison and creation of diagnostic gene panels

**Authors:** Ofer Isakov, Dina Marek-Yagel, Rotem Greenberg, Michal Naftali, Shay Ben-Shachar

PMC · DOI: 10.1093/database/baae065 · Database: The Journal of Biological Databases and Curation · 2024-07-23

## TL;DR

PANGEN is a web platform that helps compare and create diagnostic gene panels by collecting gene-phenotype associations from multiple sources.

## Contribution

PANGEN introduces a centralized platform for comparing and generating diagnostic gene panels using curated gene-phenotype associations and genomic data.

## Key findings

- PANGEN collected data from 12 sources, including 1533 panels, 9759 phenotypes, and 6979 genes.
- The platform allows users to generate new panels and stratify them into core and extended genes based on phenotype associations.
- Pairwise panel similarity is calculated to facilitate comparison between gene panels.

## Abstract

Targeted gene panel sequencing is used to limit the search for causative genetic variants solely to genes with an established association with the phenotype. The design of gene panels is challenging due to the lack of consensus regarding phenotypic associations for some genes, which results in high variation in gene composition for the same panel offered by different laboratories. We developed PANGEN, a platform that provides a centralized resource for gene panel information, with the ability to compare and generate new intelligent diagnostic panels. Gene–phenotype associations were collected from 12 public and commercial sources (Blueprint, Cegat, Centogene, ClinGen, Fulgent, GeneDx, Health in Code, Human Phenotype Ontology, Invitae, PanelApp, Prevention genetics, and Pronto diagnostics). Gene–phenotype associations are categorized into tiers according to categories derived from the original source panel. Pairwise panel similarity was calculated by dividing the number of common genes by the total number of genes in both panels. Regions with extreme guanine-cytosine (GC) content were collected from the Genome in a Bottle stratifications dataset, and putative genomic duplications were retrieved from the University of Santa Cruz database. Overall, 1533 panels, 9759 phenotypes, and 6979 genes were collected. The platform provides an interface to (i) explore and compare collected panels, (ii) find similar panels, (iii) identify genes with high GC content or duplication levels, (iv) generate gene panels by combining panels from various sources, and (v) stratify a generated panel into genes with a strong phenotype association (‘core’) and those with a weaker association (‘extended’). The presented platform represents a unique resource for gene panel exploration and comparison that facilitates the generation of tailored diagnostic panels through a public online web server.

Database URL: https://c-gc.shinyapps.io/PANGEN/

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11265858/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11265858/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/PMC11265858/full.md

---
Source: https://tomesphere.com/paper/PMC11265858