# Pathogenicity Prediction of Missense Variations in Hereditary Cancer Genes

**Authors:** Cemaliye B. Akyerli, Gizel Gerdan, Alper Bülbül, Hilal Keskin-Karakoyun, Şirin K. Yüksel, Emel Timucin

PMC · DOI: 10.3390/ijms27052453 · International Journal of Molecular Sciences · 2026-03-07

## TL;DR

HerCanPred is a machine learning tool that improves the classification of cancer-related genetic variants by using structural data from AlphaFold2.

## Contribution

HerCanPred integrates gene-specific training with 3D structural features to better predict pathogenicity in hereditary cancer genes.

## Key findings

- HerCanPred outperformed 23 existing predictors in classifying missense variants.
- 166 variants of uncertain significance were reclassified as pathogenic, and 75 as benign.
- Misclassifications were more common in disordered and surface-exposed regions of proteins.

## Abstract

HerCanPred, a machine-learning-based pathogenicity classifier specifically optimized for 63 cancer-predisposition genes, was developed to improve the interpretation of missense variants in hereditary cancer syndromes. This model integrates sequence conservation with structural features derived from AlphaFold2 (AF2) structures. HerCanPred achieved a strong performance, outperforming 23 established predictors. SHAP analysis identified AF2-derived structural features, specifically local pLDDT confidence scores and relative solvent accessible area, as the strongest predictors of variant impact. Benchmarking the strengths and limitations of HerCanPred against existing methods showed that misclassification of pathogenic variants was concentrated in disordered and surface-exposed regions, whereas benign failures were more broadly distributed. HerCanPred and three established predictors were also applied to over 57,000 variants of uncertain significance (VUS) from the same gene set. Notably, 166 VUS were reassigned as pathogenic and 75 as benign, with an enrichment of the NF1, FH, and MLH1 genes. By combining gene-specific training with 3D structural information, HerCanPred provides a robust framework for reducing diagnostic uncertainty. Our findings demonstrate that targeted, structure-aware tools can contribute to resolving VUS, providing a rational basis for systematic variant reinterpretation and more informed medical management in hereditary cancer care.

## Linked entities

- **Genes:** NF1 (neurofibromin 1) [NCBI Gene 4763], FH (fumarate hydratase) [NCBI Gene 2271], MLH1 (mutL homolog 1) [NCBI Gene 4292]

## Full-text entities

- **Genes:** NF1 (neurofibromin 1) [NCBI Gene 4763] {aka NFNS, VRNF, WSS}, MLH1 (mutL homolog 1) [NCBI Gene 4292] {aka COCA2, FCC2, HNPCC, HNPCC2, LYNCH2, MLH-1}
- **Diseases:** Hereditary Cancer (MESH:D009386), cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12985416/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12985416/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12985416/full.md

---
Source: https://tomesphere.com/paper/PMC12985416