Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet Understanding and Editing
Anmol Gulati, Sahil Sen, Waqar Sarguroh, Kevin Paul

TL;DR
BRTR introduces an iterative, agentic framework for multimodal spreadsheet understanding that significantly improves reasoning accuracy and supports end-to-end editing workflows, outperforming previous methods on key benchmarks.
Contribution
It replaces single-pass retrieval with an iterative tool-calling loop, enabling reliable multi-step reasoning and structured editing in complex enterprise spreadsheets.
Findings
Achieves state-of-the-art performance on three benchmarks
Identifies NVIDIA NeMo Retriever 1B as top multimodal embedding model
Confirms the importance of planner, retrieval, and iterative reasoning
Abstract
Recent advances in multimodal Retrieval-Augmented Generation (RAG) enable Large Language Models (LLMs) to analyze enterprise spreadsheet workbooks containing millions of cells, cross-sheet dependencies, and embedded visual artifacts. However, state-of-the-art approaches exclude critical context through single-pass retrieval, lose data resolution through compression, and exceed LLM context windows through naive full-context injection, preventing reliable multi-step reasoning over complex enterprise workbooks. We introduce Beyond Rows to Reasoning (BRTR), a multimodal agentic framework for spreadsheet understanding that replaces single-pass retrieval with an iterative tool-calling loop, supporting end-to-end Excel workflows from complex analysis to structured editing. Supported by over 200 hours of expert human evaluation, BRTR achieves state-of-the-art performance across three frontier…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpreadsheets and End-User Computing · Information Retrieval and Search Behavior · Data Visualization and Analytics
