Agentic Large Language Models for Conceptual Systems Engineering and Design

Soheyl Massoudi; Mark Fuge

arXiv:2507.08619·cs.AI·November 4, 2025

Agentic Large Language Models for Conceptual Systems Engineering and Design

Soheyl Massoudi, Mark Fuge

PDF

TL;DR

This study compares multi-agent and two-agent LLM workflows in early engineering design, demonstrating that structured multi-agent systems improve design detail and workflow management, though challenges remain in requirement coverage and code fidelity.

Contribution

The paper introduces a structured multi-agent system for engineering design that outperforms simpler two-agent workflows in managing complex design tasks.

Findings

01

Multi-agent system produces more detailed design graphs.

02

Reasoning-distilled LLM improves workflow completion.

03

Requirement coverage remains low across systems.

Abstract

Early-stage engineering design involves complex, iterative reasoning, yet existing large language model (LLM) workflows struggle to maintain task continuity and generate executable models. We evaluate whether a structured multi-agent system (MAS) can more effectively manage requirements extraction, functional decomposition, and simulator code generation than a simpler two-agent system (2AS). The target application is a solar-powered water filtration system as described in a cahier des charges. We introduce the Design-State Graph (DSG), a JSON-serializable representation that bundles requirements, physical embodiments, and Python-based physics models into graph nodes. A nine-role MAS iteratively builds and refines the DSG, while the 2AS collapses the process to a Generator-Reflector loop. Both systems run a total of 60 experiments (2 LLMs - Llama 3.3 70B vs reasoning-distilled DeepSeek…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.