GPU accelerated enumeration and exploration of HP model genotype-phenotype maps for protein folding
S. Owen

TL;DR
This paper demonstrates how GPU acceleration enables large-scale enumeration of the HP model's genotype-phenotype map for protein folding, revealing new insights into the connectivity and complexity of the landscape.
Contribution
It introduces GPU techniques to efficiently perform large-scale enumeration of the HP model, surpassing CPU methods and enabling exploration of larger lattices and more detailed GP maps.
Findings
GPU achieved 580-700x speedup over CPU
Largest enumeration of 6x6 lattice performed
Data suggests a connected 'spaghetti' network rather than disconnected 'plum-pudding' metaphor
Abstract
Evolution can be broadly described in terms of mutations of the genotype and the subsequent selection of the phenotype. The full enumeration of a given genotype-phenotype (GP) map is therefore a powerful technique in examining evolutionary landscapes. However, because the number of genotypes typically grows exponentially with genome length, such calculations rapidly become intractable. Here I apply graphics processing unit(GPU) techniques to the hydrophobic-polar (HP)model for protein folding. This GP map is a simple and well-studied model for the complex process of protein folding. Prior studies on relatively small 2D and 3D lattices have been exclusively carried out using conventional central processing unit (CPU) approaches. By using GPU techniques, I was able to reproduce the pioneering calculations of Li et al.[1] with a speed up of 580-700 fold over a CPU. I was also able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Machine Learning in Bioinformatics · Algorithms and Data Compression
