LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources
Haichao Ji, Zibo Wang, Cheng Pan, Meng Han, Yifei Zhu, Dan Wang, and Zhu Han

TL;DR
LAFA is a novel system that combines large language models with federated analytics to enable natural language queries over decentralized data sources while preserving privacy, improving efficiency and success rates.
Contribution
LAFA introduces a hierarchical multi-agent architecture that translates natural language queries into optimized federated analytics workflows, supporting privacy-preserving data analysis.
Findings
LAFA outperforms baseline prompting strategies in success rates.
LAFA reduces redundant FA operations, saving computational resources.
LAFA effectively transforms natural language queries into executable FA workflows.
Abstract
Large Language Models (LLMs) have shown great promise in automating data analytics tasks by interpreting natural language queries and generating multi-operation execution plans. However, existing LLM-agent-based analytics frameworks operate under the assumption of centralized data access, offering little to no privacy protection. In contrast, federated analytics (FA) enables privacy-preserving computation across distributed data sources, but lacks support for natural language input and requires structured, machine-readable queries. In this work, we present LAFA, the first system that integrates LLM-agent-based data analytics with FA. LAFA introduces a hierarchical multi-agent architecture that accepts natural language queries and transforms them into optimized, executable FA workflows. A coarse-grained planner first decomposes complex queries into sub-queries, while a fine-grained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · Cloud Computing and Resource Management · Privacy-Preserving Technologies in Data
