ProfiliTable: Profiling-Driven Tabular Data Processing via Agentic Workflows
Wei Liu, Yang Gu, Xi Yan, Zihan Nan, Beicheng Xu, Keyao Ding, Bin Cui, Wentao Zhang

TL;DR
ProfiliTable is an autonomous multi-agent framework that enhances tabular data processing by dynamic profiling, iterative refinement, and interactive exploration, improving robustness and accuracy in complex data transformation tasks.
Contribution
It introduces a novel multi-agent system with dynamic profiling and feedback-driven refinement for more reliable and semantically accurate table data processing.
Findings
Outperforms strong baselines on 18 tabular task types
Effective in complex multi-step data transformation scenarios
Dynamic profiling improves semantic understanding and robustness
Abstract
Table processing-including cleaning, transformation, augmentation, and matching-is a foundational yet error-prone stage in real-world data pipelines. While recent LLM-based approaches show promise for automating such tasks, they often struggle in practice due to ambiguous instructions, complex task structures, and the lack of structured feedback, resulting in syntactically correct but semantically flawed code. To address these challenges, we propose ProfiliTable, an autonomous multi-agent framework centered on dynamic profiling, which constructs and iteratively refines a unified execution context through interactive exploration, knowledge-augmented synthesis, and feedback-driven refinement. ProfiliTable integrates (i) a Profiler that performs ReAct-style data exploration to build semantic understanding, (ii) a Generator that retrieves curated operators to synthesize task-aware code, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
