Applying Deep Reinforcement Learning to the HP Model for Protein   Structure Prediction

Kaiyuan Yang; Houjing Huang; Olafs Vandans; Adithya Murali; Fujia; Tian; Roland H.C. Yap; Liang Dai

arXiv:2211.14939·cs.LG·March 14, 2023·1 cites

Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction

Kaiyuan Yang, Houjing Huang, Olafs Vandans, Adithya Murali, Fujia, Tian, Roland H.C. Yap, Liang Dai

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that deep reinforcement learning, specifically a deep Q-network with LSTM, can effectively find optimal protein conformations in the simplified HP lattice model, outperforming traditional heuristics.

Contribution

The study introduces a DRL approach using LSTM-enhanced DQN for the HP model, achieving state-of-the-art solutions without manual heuristics.

Findings

01

Successfully finds best-known conformations for benchmark sequences

02

LSTM-based DQN significantly improves learning and search efficiency

03

Can sample multiple optimal solutions per trial

Abstract

A central problem in computational biophysics is protein structure prediction, i.e., finding the optimal folding of a given amino acid sequence. This problem has been studied in a classical abstract model, the HP model, where the protein is modeled as a sequence of H (hydrophobic) and P (polar) amino acids on a lattice. The objective is to find conformations maximizing H-H contacts. It is known that even in this reduced setting, the problem is intractable (NP-hard). In this work, we apply deep reinforcement learning (DRL) to the two-dimensional HP model. We can obtain the conformations of best known energies for benchmark HP sequences with lengths from 20 to 50. Our DRL is based on a deep Q-network (DQN). We find that a DQN based on long short-term memory (LSTM) architecture greatly enhances the RL learning ability and significantly improves the search process. DRL can sample the state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

compsoftmatterbiophysics-cityu-hk/applying-drl-to-hp-model-for-protein-structure-prediction
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProtein Structure and Dynamics · Machine Learning in Bioinformatics · Genomics and Phylogenetic Studies

MethodsQ-Learning · Convolution · Dense Connections · Deep Q-Network