Exploring Robust Multi-Agent Workflows for Environmental Data Management
Boyuan Guan, Jason Liu, Yanzhao Wu, Kiavash Bahreini

TL;DR
This paper presents EnviSmart, a multi-agent system for environmental data management that enhances reliability and efficiency by externalizing knowledge and using role-separated agents to prevent irreversible errors.
Contribution
It introduces a novel multi-agent architecture with persistent artifacts and role separation to improve reliability and efficiency in environmental data workflows.
Findings
Multi-agent workflow completed in two days with artifact reuse.
Audited handoffs detected and prevented a coordinate transformation error.
Boundary-based containment achieved with 10-minute detection latency.
Abstract
Embedding LLM-driven agents into environmental FAIR data management is compelling - they can externalize operational knowledge and scale curation across heterogeneous data and evolving conventions. However, replacing deterministic components with probabilistic workflows changes the failure mode: LLM pipelines may generate plausible but incorrect outputs that pass superficial checks and propagate into irreversible actions such as DOI minting and public release. We introduce EnviSmart, a production data management system deployed on campus-wide storage infrastructure for environmental research. EnviSmart treats reliability as an architectural property through two mechanisms: a three-track knowledge architecture that externalizes behaviors (governance constraints), domain knowledge (retrievable context), and skills (tool-using procedures) as persistent, interlocking artifacts; and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
