3-10 and Pi-Helices: Stochastic Events on Sequence Space; Reasons and Implications of their Accidental Occurrences across Protein Universe
Param Priya Singh, Anirban Banerji

TL;DR
This study analyzes the probabilistic occurrence of 3-10 and Pi-helices in proteins, revealing their random, rare, and evolutionarily transient nature, supported by statistical modeling and a new sequence comparison algorithm.
Contribution
It provides a probabilistic framework for understanding 3-10 and Pi-helices, introduces a new algorithm for sequence differentiation, and explores their evolutionary implications.
Findings
Occurrences follow Poisson distribution, indicating randomness.
Sequence intervals fit gamma and exponential distributions.
Higher presence of these helices in disallowed Ramachandran regions.
Abstract
Considering all available non-redundant protein structures across different structural classes, present study identified the probabilistic characteristics that describe several facets of the occurrence of 3(10) and Pi-helices in proteins. Occurrence profile of 3(10) and Pi-helices revealed that, their presence follows Poisson flow on the primary structure; implying that, their occurrence profile is rare, random and accidental. Structural class-specific statistical analyses of sequence intervals between consecutive occurrences of 3(10) and Pi-helices revealed that these could be best described by gamma and exponential distributions, across structural classes. Comparative study of normalized percentage of non-glycine and non-proline residues in 3(10), Pi and alpha-helices revealed a considerably higher proportion of 3(10) and Pi-helix residues in disallowed, generous and allowed regions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · RNA and protein synthesis mechanisms · Machine Learning in Bioinformatics
