MATATA: Weakly Supervised End-to-End MAthematical Tool-Augmented Reasoning for Tabular Applications
Vishnou Vinayagame, Gregory Senay, and Luis Mart\'i

TL;DR
MATATA is a weakly supervised, end-to-end reasoning framework that enhances small language models for understanding and reasoning over business documents with tabular data, achieving state-of-the-art results.
Contribution
It introduces a novel annotation-free training paradigm for multi-step reasoning agents, eliminating the need for intermediate supervision and enabling effective use of smaller language models.
Findings
Achieves state-of-the-art on FinQA and TAT-QA datasets.
Closely matches GPT-4 performance on TabMWP.
Demonstrates robust performance across multiple datasets.
Abstract
Business documents often contain substantial tabular and textual information with numerical values, requiring mathematical reasoning for effective document understanding. While Small Language Models (SLMs) still struggle at this task, tool-augmented multi-step agents perform better, at the cost of relying on closed-source or larger models, external data, or extensive prompt-engineering. This work introduces MATATA, a novel weakly supervised end-to-end approach to train multi-step reasoning language agents for document tabular applications. MATATA presents an annotation-free paradigm for each agent to enhance 3.8B/8B SLMs. During its two-stage training, MATATA uses the final outcome of the multi-step reasoning chain as weak supervision. This approach avoids having to individually supervise each intermediate agent in the reasoning chain. By employing an adaptive planner and shared tools…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Topic Modeling
MethodsByte Pair Encoding · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Residual Connection · Adam · Attention Is All You Need · Softmax · Label Smoothing · Dropout · Linear Layer
