# Assessing the performance of 28 pathogenicity prediction methods on rare single nucleotide variants in coding regions

**Authors:** Jee Yeon Heo, Ju Han Kim

PMC · DOI: 10.1186/s12864-025-11787-4 · BMC Genomics · 2025-07-07

## TL;DR

This study evaluates 28 methods for predicting the harmfulness of rare genetic variants, finding that methods using allele frequency and conservation perform best.

## Contribution

The study provides a comprehensive performance comparison of 28 pathogenicity prediction methods on rare variants using the latest ClinVar dataset.

## Key findings

- MetaRNN and ClinPred showed the highest predictive power on rare variants by incorporating conservation, other scores, and allele frequency.
- Most methods had lower specificity than sensitivity, and performance metrics declined as allele frequency decreased.
- About 10% of variants had missing prediction scores, highlighting limitations in current methods.

## Abstract

Accurate pathogenicity prediction of rare variants in coding regions is crucial for prioritizing candidate variants in human diseases and advancing personalized precision medicine. Although many prediction methods have been developed, it remains unclear how they perform specifically on rare variants.

In this study, the performance of 28 pathogenicity prediction methods was assessed using the latest ClinVar dataset, with a focus on rare variants and various allele frequency (AF) ranges. Ten evaluation metrics were employed to comprehensively assess the predictive performance of each method. The methods were selected based on their training approaches, including whether the training dataset was filtered by AF and whether AF was incorporated as a feature. Most methods focused on missense and start-lost variants, covering only a subset of nonsynonymous SNVs. The average missing rate of approximately 10% was observed in these variants, indicating that prediction scores were unavailable for them. MetaRNN and ClinPred, which incorporated conservation, other prediction scores, and AFs as features, demonstrated the highest predictive power on rare variants. For most methods, specificity was lower than sensitivity. Across various AF ranges, most performance metrics tended to decline as AF decreased, with specificity showing a particularly large decline.

These results provide insights into the strengths and limitations of each method in predicting the pathogenicity of rare variants, which may guide future improvements in predictive models. Furthermore, while AF and existing prediction scores offer valuable information for prediction methods, the identification of novel biological features is essential to overcome current limitations and further improve predictive performance.

The online version contains supplementary material available at 10.1186/s12864-025-11787-4.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12235850/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12235850/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC12235850/full.md

---
Source: https://tomesphere.com/paper/PMC12235850