MRT at SemEval-2025 Task 8: Maximizing Recovery from Tables with Multiple Steps

Maximiliano Hormaz\'abal Lagos,\'Alvaro Bueno Saez; H\'ector Cerezo-Costas; Pedro Alonso Doval; Jorge Alcalde Vesteiro

arXiv:2505.22264·cs.CL·May 29, 2025

MRT at SemEval-2025 Task 8: Maximizing Recovery from Tables with Multiple Steps

Maximiliano Hormaz\'abal Lagos,\'Alvaro Bueno Saez, H\'ector Cerezo-Costas, Pedro Alonso Doval, Jorge Alcalde Vesteiro

PDF

Open Access 1 Video

TL;DR

This paper presents a multi-step approach using Python code generation with LLMs to answer questions over tables, involving understanding, instruction generation, code translation, execution, and error handling, achieving 70.50% score.

Contribution

It introduces a novel multi-step method leveraging LLMs and fine-tuned prompts for table question-answering, enhancing accuracy and robustness.

Findings

01

Achieved 70.50% score on subtask 1.

02

Demonstrated effectiveness of multi-step LLM-based approach.

03

Utilized open source LLMs with optimized prompts.

Abstract

In this paper we expose our approach to solve the \textit{SemEval 2025 Task 8: Question-Answering over Tabular Data} challenge. Our strategy leverages Python code generation with LLMs to interact with the table and get the answer to the questions. The process is composed of multiple steps: understanding the content of the table, generating natural language instructions in the form of steps to follow in order to get the answer, translating these instructions to code, running it and handling potential errors or exceptions. These steps use open source LLMs and fine grained optimized prompts for each task (step). With this approach, we achieved a score of $70.50%$ for subtask 1.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MRT at SemEval-2025 Task 8: Maximizing Recovery from Tables with Multiple Steps· underline

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques