Agentic Large Language Models for Conceptual Systems Engineering and Design
Soheyl Massoudi, Mark Fuge

TL;DR
This study compares multi-agent and two-agent LLM workflows in early engineering design, demonstrating that structured multi-agent systems improve design detail and workflow management, though challenges remain in requirement coverage and code fidelity.
Contribution
The paper introduces a structured multi-agent system for engineering design that outperforms simpler two-agent workflows in managing complex design tasks.
Findings
Multi-agent system produces more detailed design graphs.
Reasoning-distilled LLM improves workflow completion.
Requirement coverage remains low across systems.
Abstract
Early-stage engineering design involves complex, iterative reasoning, yet existing large language model (LLM) workflows struggle to maintain task continuity and generate executable models. We evaluate whether a structured multi-agent system (MAS) can more effectively manage requirements extraction, functional decomposition, and simulator code generation than a simpler two-agent system (2AS). The target application is a solar-powered water filtration system as described in a cahier des charges. We introduce the Design-State Graph (DSG), a JSON-serializable representation that bundles requirements, physical embodiments, and Python-based physics models into graph nodes. A nine-role MAS iteratively builds and refines the DSG, while the 2AS collapses the process to a Generator-Reflector loop. Both systems run a total of 60 experiments (2 LLMs - Llama 3.3 70B vs reasoning-distilled DeepSeek…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
