Deep Supervised and Convolutional Generative Stochastic Network for Protein Secondary Structure Prediction
Jian Zhou, Olga G. Troyanskaya

TL;DR
This paper introduces a deep supervised convolutional generative stochastic network for protein secondary structure prediction, achieving state-of-the-art accuracy by leveraging hierarchical deep learning and convolutional architecture.
Contribution
The work presents a novel supervised GSN extension with convolutional architecture tailored for high-dimensional protein data, improving prediction accuracy over previous methods.
Findings
Achieved 66.4% Q8 accuracy on CB513 dataset.
Outperformed previous best accuracy of 64.9%.
Demonstrated effectiveness of deep hierarchical convolutional models for protein structure prediction.
Abstract
Predicting protein secondary structure is a fundamental problem in protein structure prediction. Here we present a new supervised generative stochastic network (GSN) based method to predict local secondary structure with deep hierarchical representations. GSN is a recently proposed deep learning technique (Bengio & Thibodeau-Laufer, 2013) to globally train deep generative model. We present the supervised extension of GSN, which learns a Markov chain to sample from a conditional distribution, and applied it to protein structure prediction. To scale the model to full-sized, high-dimensional data, like protein sequences with hundreds of amino acids, we introduce a convolutional architecture, which allows efficient learning across multiple layers of hierarchical representations. Our architecture uniquely focuses on predicting structured low-level labels informed with both low and high-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics
