Toward a multilevel representation of protein molecules: comparative   approaches to the aggregation/folding propensity problem

Lorenzo Livi; Alessandro Giuliani; Antonello Rizzi

arXiv:1407.7559·cs.CE·July 22, 2015

Toward a multilevel representation of protein molecules: comparative approaches to the aggregation/folding propensity problem

Lorenzo Livi, Alessandro Giuliani, Antonello Rizzi

PDF

TL;DR

This study compares multiple protein representations, including sequences and contact graphs, using classification methods to analyze folding and aggregation propensities in E. coli proteins, revealing key thresholds and features related to protein folding behavior.

Contribution

It introduces a multilevel protein representation approach combining sequence and graph data, and demonstrates its effectiveness in predicting folding and aggregation tendencies.

Findings

01

Identifies a threshold of around 250 residues for foldability.

02

Highlights the importance of contact graph spectra in folding discrimination.

03

Establishes statistically significant relationships between protein chemico-physical properties and folding behavior.

Abstract

This paper builds upon the fundamental work of Niwa et al. [34], which provides the unique possibility to analyze the relative aggregation/folding propensity of the elements of the entire Escherichia coli (E. coli) proteome in a cell-free standardized microenvironment. The hardness of the problem comes from the superposition between the driving forces of intra- and inter-molecule interactions and it is mirrored by the evidences of shift from folding to aggregation phenotypes by single-point mutations [10]. Here we apply several state-of-the-art classification methods coming from the field of structural pattern recognition, with the aim to compare different representations of the same proteins gathered from the Niwa et al. data base; such representations include sequences and labeled (contact) graphs enriched with chemico-physical attributes. By this comparison, we are able to identify…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.