Protein language model rescue mutations highlight variant effects and structure in clinically relevant genes
Onuralp Soylemez, Pablo Cordero

TL;DR
This paper demonstrates that protein language models can identify rescue mutations that reveal structural features and variant effects in clinically relevant genes, surpassing traditional structure predictors.
Contribution
It introduces a systematic approach using protein language models to find compensatory mutations, uncovering structural insights missed by other methods.
Findings
Protein language models predict impactful rescue mutations.
Rescue mutations reveal new structural features.
Community efforts can improve clinical mutation predictions.
Abstract
Despite being self-supervised, protein language models have shown remarkable performance in fundamental biological tasks such as predicting impact of genetic variation on protein structure and function. The effectiveness of these models on diverse set of tasks suggests that they learn meaningful representations of fitness landscape that can be useful for downstream clinical applications. Here, we interrogate the use of these language models in characterizing known pathogenic mutations in curated, medically actionable genes through an exhaustive search of putative compensatory mutations on each variant's genetic background. Systematic analysis of the predicted effects of these compensatory mutations reveal unappreciated structural features of proteins that are missed by other structure predictors like AlphaFold. While deep mutational scan experiments provide an unbiased estimate of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Genomics and Rare Diseases · Genomics and Phylogenetic Studies
MethodsAlphaFold
