mkite: A distributed computing platform for high-throughput materials simulations
Daniel Schwalbe-Koda

TL;DR
mkite is a Python-based distributed computing platform that streamlines high-throughput materials simulations across heterogeneous environments, enabling complex workflows and large-scale data management in materials science.
Contribution
The paper introduces mkite, a novel Python package that facilitates distributed high-throughput simulations with flexible workflows and decoupled data management, addressing limitations of existing software.
Findings
Supports complex, branched workflows for materials discovery
Enables scalable simulations across heterogeneous hardware
Demonstrated with case studies on zeolite synthesis and catalysts
Abstract
Advances in high-throughput simulation (HTS) software enabled computational databases and big data to become common resources in materials science. However, while computational power is increasingly larger, software packages orchestrating complex workflows in heterogeneous environments are scarce. This paper introduces mkite, a Python package for performing HTS in distributed computing environments. The mkite toolkit is built with the server-client pattern, decoupling production databases from client runners. When used in combination with message brokers, mkite enables any available client to perform calculations without prior hardware specification on the server side. Furthermore, the software enables the creation of complex workflows with multiple inputs and branches, facilitating the exploration of combinatorial chemical spaces. Software design principles are discussed in detail,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Scientific Computing and Data Management · Advanced Data Storage Technologies
