High-performance cloud computing for exhaustive protein-protein docking
Masahito Ohue, Kento Aoyama, Yutaka Akiyama

TL;DR
This paper demonstrates how cloud computing platforms like Azure can be effectively used for large-scale, high-performance protein-protein docking computations, showing strong scalability and cost efficiency.
Contribution
The study successfully ports MEGADOCK to Microsoft Azure, constructing a scalable HPC environment with up to 1,600 CPU cores and 960 GPUs, and evaluates its performance and cost benefits.
Findings
High scalability with strong scaling values of 0.93 (CPU) and 0.89 (GPU).
GPU instances significantly reduce computation time and costs.
The cloud environment is highly portable and suitable for large-scale bioinformatics applications.
Abstract
Public cloud computing environments, such as Amazon AWS, Microsoft Azure, and the Google Cloud Platform, have achieved remarkable improvements in computational performance in recent years, and are also expected to be able to perform massively parallel computing. As the cloud enables users to use thousands of CPU cores and GPU accelerators casually, and various software types can be used very easily by cloud images, the cloud is beginning to be used in the field of bioinformatics. In this study, we ported the original protein-protein interaction prediction (protein-protein docking) software, MEGADOCK, into Microsoft Azure as an example of an HPC cloud environment. A cloud parallel computing environment with up to 1,600 CPU cores and 960 GPUs was constructed using four CPU instance types and two GPU instance types, and the parallel computing performance was evaluated. Our MEGADOCK on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Computational Drug Discovery Methods · Genetics, Bioinformatics, and Biomedical Research
