Deep Learning of Protein Structural Classes: Any Evidence for an 'Urfold'?
Menuka Jaiswal, Saad Saleem, Yonghyeon Kweon, Eli J Draizen, Stella, Veretnik, Cameron Mura, Philip E. Bourne

TL;DR
This paper employs deep learning models on protein structures to explore and potentially redefine protein classification schemes, revealing new insights into protein relationships beyond traditional methods.
Contribution
The study introduces a novel deep learning approach using convolutional autoencoders to analyze protein domains and propose a new perspective on protein classification and interrelationships.
Findings
Deep learning models can learn structural features defining protein superfamilies.
Pairwise distance matrices reveal relationships beyond geometric similarity.
Hierarchical clustering offers a new view of protein interrelationships.
Abstract
Recent computational advances in the accurate prediction of protein three-dimensional (3D) structures from amino acid sequences now present a unique opportunity to decipher the interrelationships between proteins. This task entails--but is not equivalent to--a problem of 3D structure comparison and classification. Historically, protein domain classification has been a largely manual and subjective activity, relying upon various heuristics. Databases such as CATH represent significant steps towards a more systematic (and automatable) approach, yet there still remains much room for the development of more scalable and quantitative classification methods, grounded in machine learning. We suspect that re-examining these relationships via a Deep Learning (DL) approach may entail a large-scale restructuring of classification schemes, improved with respect to the interpretability of distant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
