Identifying critical residues of a protein using meaningfully-thresholded Random Geometric Graphs
Chuqiao Zhang, Sarath Chandra Dantu, Debarghya Mitra, Dalia Chakrabarty

TL;DR
This paper introduces novel methods using thresholded Random Geometric Graphs to identify critical residues in proteins, validated against experimental data, with potential applications in understanding protein function.
Contribution
It develops three new RGG-based approaches for detecting critical residues, incorporating organic thresholding and dynamic analysis during protein evolution.
Findings
RGG-based methods outperform traditional thresholding techniques
Critical residues identified align well with experimental data
Dynamic analysis reveals residue importance changes over time
Abstract
Identification of critical residues of a protein is actively pursued, since such residues are essential for protein function. We present three ways of recognising critical residues of an example protein, the evolution of which is tracked via molecular dynamical simulations. Our methods are based on learning a Random Geometric Graph (RGG) variable, where the state variable of each of 156 residues, is attached to a node of this graph, with the RGG learnt using the matrix of correlations between state variables of each residue-pair. Given the categorical nature of the state variable, correlation between a residue pair is computed using Cramer's V. We advance an organic thresholding to learn an RGG, and compare results against extant thresholding techniques, when parametrising criticality as the nodal degree in the learnt RGG. Secondly, we develop a criticality measure by ranking the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Machine Learning in Bioinformatics · Bioinformatics and Genomic Networks
