Prior Knowledge based mutation prioritization towards causal variant finding in rare disease
Vasundhara Dehiya, Jaya Thomas, Lee Sael

TL;DR
This paper introduces a structure-based mutation prioritization method that leverages protein structural features to improve the identification of pathogenic variants in rare diseases, especially when statistical evidence is limited.
Contribution
It proposes a novel approach using protein structural features for mutation effect prediction, enhancing accuracy over sequence-only methods in rare disease contexts.
Findings
Structure-based features achieve high discernibility of pathogenic vs benign mutations.
Combining structure- and sequence-based features improves prediction accuracy.
Structure features are crucial for assessing rare variants in rare diseases.
Abstract
How do we determine the mutational effects in exome sequencing data with little or no statistical evidence? Can protein structural information fill in the gap of not having enough statistical evidence? In this work, we answer the two questions with the goal towards determining pathogenic effects of rare variants in rare disease. We take the approach of determining the importance of point mutation loci focusing on protein structure features. The proposed structure-based features contain information about geometric, physicochemical, and functional information of mutation loci and those of structural neighbors of the loci. The performance of the structure-based features trained on 80\% of HumDiv and tested on 20\% of HumDiv and on ClinVar datasets showed high levels of discernibility in the mutation's pathogenic or benign effects: F score of 0.71 and 0.68 respectively using multi-layer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Rare Diseases · Genomic variations and chromosomal abnormalities · Cancer Genomics and Diagnostics
