Toward autocorrection of chemical process flowsheets using large language models
Lukas Schulze Balhorn, Marc Caballero, Artur M. Schweidtmann

TL;DR
This paper introduces a novel AI-based method using large language models to automatically detect and correct errors in chemical process flowsheets, aiming to improve safety and efficiency.
Contribution
It adapts LLMs for flowsheet autocorrection, demonstrating effective synthetic data training and promising accuracy in error correction tasks.
Findings
Top-1 accuracy of 80% on synthetic flowsheets
Top-5 accuracy of 84% on test dataset
Potential to assist chemical engineers in error correction
Abstract
The process engineering domain widely uses Process Flow Diagrams (PFDs) and Process and Instrumentation Diagrams (P&IDs) to represent process flows and equipment configurations. However, the P&IDs and PFDs, hereafter called flowsheets, can contain errors causing safety hazards, inefficient operation, and unnecessary expenses. Correcting and verifying flowsheets is a tedious, manual process. We propose a novel generative AI methodology for automatically identifying errors in flowsheets and suggesting corrections to the user, i.e., autocorrecting flowsheets. Inspired by the breakthrough of Large Language Models (LLMs) for grammatical autocorrection of human language, we investigate LLMs for the autocorrection of flowsheets. The input to the model is a potentially erroneous flowsheet and the output of the model are suggestions for a corrected flowsheet. We train our autocorrection model on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Software Engineering Research · Topic Modeling
