Leveraging Large Language Models to Build and Execute Computational   Workflows

Alejandro Duque; Abdullah Syed; Kastan V. Day; Matthew J. Berry,; Daniel S. Katz; Volodymyr V. Kindratenko

arXiv:2312.07711·cs.AI·December 14, 2023·1 cites

Leveraging Large Language Models to Build and Execute Computational Workflows

Alejandro Duque, Abdullah Syed, Kastan V. Day, Matthew J. Berry,, Daniel S. Katz, Volodymyr V. Kindratenko

PDF

Open Access

TL;DR

This paper investigates how large language models can be used to automatically generate and execute complex scientific workflows, reducing the need for traditional coding.

Contribution

It introduces a strategy for integrating LLMs with workflow management systems, exemplified by initial experiments with Phyloflow and OpenAI's API.

Findings

01

Successful initial integration with Phyloflow and OpenAI API

02

Proposed framework for LLM-driven scientific workflows

03

Potential to simplify complex scientific computations

Abstract

The recent development of large language models (LLMs) with multi-billion parameters, coupled with the creation of user-friendly application programming interfaces (APIs), has paved the way for automatically generating and executing code in response to straightforward human queries. This paper explores how these emerging capabilities can be harnessed to facilitate complex scientific workflows, eliminating the need for traditional coding methods. We present initial findings from our attempt to integrate Phyloflow with OpenAI's function-calling API, and outline a strategy for developing a comprehensive workflow management system based on these concepts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Topic Modeling · Distributed and Parallel Computing Systems