Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning

Houxing Ren; Mingjie Zhan; Zimu Lu; Ke Wang; Yunqiao Yang; Haotian Hou; Hongsheng Li

arXiv:2604.12282·cs.CL·April 15, 2026

Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning

Houxing Ren, Mingjie Zhan, Zimu Lu, Ke Wang, Yunqiao Yang, Haotian Hou, Hongsheng Li

PDF

1 Repo

TL;DR

SpreadsheetAgent introduces a multi-agent, multi-modal framework that incrementally interprets and reasons over complex, large-scale spreadsheets, improving robustness and accuracy in real-world applications.

Contribution

The paper presents a novel two-stage multi-agent approach that leverages multiple modalities and verification to enhance spreadsheet understanding beyond existing LLM-based methods.

Findings

01

Achieves 38.16% on Spreadsheet Bench, surpassing baseline by 2.89 points.

02

Effectively interprets large, complex spreadsheets using incremental, localized reasoning.

03

Demonstrates robustness and scalability in real-world spreadsheet tasks.

Abstract

Spreadsheets are central to real-world applications such as enterprise reporting, auditing, and scientific data management. Despite their ubiquity, existing large language model based approaches typically treat tables as plain text, overlooking critical layout cues and visual semantics. Moreover, real-world spreadsheets are often massive in scale, exceeding the input length that LLMs can efficiently process. To address these challenges, we propose SpreadsheetAgent, a two-stage multi-agent framework for spreadsheet understanding that adopts a step-by-step reading and reasoning paradigm. Instead of loading the entire spreadsheet at once, SpreadsheetAgent incrementally interprets localized regions through multiple modalities, including code execution results, images, and LaTeX tables. The method first constructs a structural sketch and row/column summaries, and then performs task-driven…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

renhouxing/SpreadsheetAgent.git
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.