PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework
Sina Montazeri, Yunhe Feng, Kewei Sha

TL;DR
PublicAgent is a multi-agent framework leveraging specialized LLM agents to improve open data analysis workflows, addressing limitations of end-to-end models and enabling non-experts to access and analyze public datasets effectively.
Contribution
The paper introduces PublicAgent, a novel multi-agent architecture that decomposes complex data analysis tasks into specialized agents, demonstrating improved robustness and effectiveness across models and queries.
Findings
Specialization improves effectiveness regardless of model strength.
Universal and conditional agents have distinct effectiveness patterns.
Removing key agents causes significant failures.
Abstract
Open data repositories hold potential for evidence-based decision-making, yet are inaccessible to non-experts lacking expertise in dataset discovery, schema mapping, and statistical analysis. Large language models show promise for individual tasks, but end-to-end analytical workflows expose fundamental limitations: attention dilutes across growing contexts, specialized reasoning patterns interfere, and errors propagate undetected. We present PublicAgent, a multi-agent framework that addresses these limitations through decomposition into specialized agents for intent clarification, dataset discovery, analysis, and reporting. This architecture maintains focused attention within agent contexts and enables validation at each stage. Evaluation across five models and 50 queries derives five design principles for multi-agent LLM systems. First, specialization provides value independent of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Semantic Web and Ontologies · Research Data Management Practices
