Towards Human-Level Understanding of Complex Process Engineering   Schematics: A Pedagogical, Introspective Multi-Agent Framework for   Open-Domain Question Answering

Sagar Srinivas Sakhinana; Geethan Sannidhi; Venkataramana Runkana

arXiv:2409.00082·cs.CL·September 4, 2024

Towards Human-Level Understanding of Complex Process Engineering Schematics: A Pedagogical, Introspective Multi-Agent Framework for Open-Domain Question Answering

Sagar Srinivas Sakhinana, Geethan Sannidhi, Venkataramana Runkana

PDF

Open Access

TL;DR

This paper introduces a secure, multi-agent, open-source framework for understanding complex process diagrams in chemical industries, enhancing privacy, explainability, and customization over proprietary AI models.

Contribution

It presents a novel hierarchical multi-agent RAG framework using open-source models and ReAct prompting for improved open-domain question answering on process diagrams.

Findings

01

Validated effectiveness through rigorous experiments

02

Outperforms existing approaches in accuracy and privacy

03

Enables domain-specific customization on consumer hardware

Abstract

In the chemical and process industries, Process Flow Diagrams (PFDs) and Piping and Instrumentation Diagrams (P&IDs) are critical for design, construction, and maintenance. Recent advancements in Generative AI, such as Large Multimodal Models (LMMs) like GPT4 (Omni), have shown promise in understanding and interpreting process diagrams for Visual Question Answering (VQA). However, proprietary models pose data privacy risks, and their computational complexity prevents knowledge editing for domain-specific customization on consumer hardware. To overcome these challenges, we propose a secure, on-premises enterprise solution using a hierarchical, multi-agent Retrieval Augmented Generation (RAG) framework for open-domain question answering (ODQA) tasks, offering enhanced data privacy, explainability, and cost-effectiveness. Our novel multi-agent framework employs introspective and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBusiness Process Modeling and Analysis · Multi-Agent Systems and Negotiation · Semantic Web and Ontologies