DeepEye: A Steerable Self-driving Data Agent System
Boyan Li, Yiran Peng, Yupeng Xie, Sirong Lu, Yizhang Zhu, Xing Mu, Xinyu Liu, Yuyu Luo

TL;DR
DeepEye is a scalable, trustworthy data agent system that enables complex, multimodal data analysis workflows through a workflow-centric architecture and hierarchical reasoning.
Contribution
It introduces a workflow engine and multimodal orchestration protocol to address limitations of current LLM-based data agents in heterogeneous data analysis.
Findings
Successfully orchestrates complex multimodal workflows
Ensures structural correctness and optimized execution
Generates diverse outputs like dashboards and reports
Abstract
Large Language Models (LLMs) have revolutionized natural language interaction with data. The "holy grail" of data analytics is to build autonomous Data Agents that can self-drive complex data analysis workflows. However, current implementations are still limited to linear "ChatBI" systems. These systems struggle with joint analysis across heterogeneous data sources (e.g., databases, documents, and data files) and often encounter "context explosion" in complex and iterative data analysis workflows. To address these challenges, we present DeepEye, a production-ready data agent system that adopts a workflow-centric architecture to ensure scalability and trustworthiness. DeepEye introduces a Unified Multimodal Orchestration protocol, enabling seamless integration of structured and unstructured data sources. To mitigate hallucinations, it employs Hierarchical Reasoning with context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
