Graph identification of proteins in tomograms (GRIP-Tomo)
August George, Doo Nam Kim, Trevor Moser, Ian T. Gildea, James E., Evans, Margaret S. Cheung

TL;DR
This paper introduces a novel graph-based pattern mining method for identifying proteins in tomograms that is robust to data distortions and does not rely on predefined templates, improving interpretability and classification accuracy.
Contribution
The study presents a topological graph-based approach for protein identification in tomograms that is invariant to common data distortions and does not require prior templates or human bias.
Findings
Accurately identified proteins from simulated tomograms with distortions.
Demonstrated robustness to missing data, tumbling, and missing wedge effects.
Enabled classification of proteins based on topological features.
Abstract
In this study, we present a method of pattern mining based on network theory that enables the identification of protein structures or complexes from synthetic volume densities, without the knowledge of predefined templates or human biases for refinement. We hypothesized that the topological connectivity of protein structures is invariant, and they are distinctive for the purpose of protein identification from distorted data presented in volume densities. Three-dimensional densities of a protein or a complex from simulated tomographic volumes were transformed into mathematical graphs as observables. We systematically introduced data distortion or defects such as missing fullness of data, the tumbling effect, and the missing wedge effect into the simulated volumes, and varied the distance cutoffs in pixels to capture the varying connectivity between the density cluster centroids in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Topological and Geometric Data Analysis · Data Visualization and Analytics
