Exploring Generative Process Reward Modeling for Semi-Structured Data: A Case Study of Table Question Answering

Lei Tang; Wei Zhou; Mohsen Mesgar

arXiv:2510.20304·cs.CL·January 26, 2026

Exploring Generative Process Reward Modeling for Semi-Structured Data: A Case Study of Table Question Answering

Lei Tang, Wei Zhou, Mohsen Mesgar

PDF

Open Access 1 Video

TL;DR

This paper investigates the use of process reward models for semi-structured data, specifically table question answering, revealing their potential and current limitations in this domain.

Contribution

First systematic study of process reward models applied to table question answering, analyzing their effectiveness and challenges in semi-structured data tasks.

Findings

01

PRMs with textual and code verification aid answer selection

02

PRMs struggle to generalize to out-of-domain data

03

Weak correlation between step verification and answer accuracy

Abstract

Process reward models (PRMs) enhance complex reasoning in large language models (LLMs) by evaluating candidate solutions step-by-step and selecting answers based on aggregated step scores. While effective in domains such as mathematics, their applicability to tasks involving semi-structured data, like table question answering (TQA), remains unexplored. TQA poses unique challenges for PRMs, including abundant irrelevant information, loosely connected reasoning steps, and domain-specific reasoning. This work presents the first systematic study of PRMs for TQA. We evaluate state-of-the-art generative PRMs on TQA from both answer and step perspectives. Results show that PRMs that combine textual and code verification can aid solution selection but struggle to generalize to out-of-domain data. Analysis reveals a weak correlation between performance in step-level verification and answer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Exploring Generative Process Reward Modeling for Semi-Structured Data: A Case Study of Table Question Answering· underline

Taxonomy

TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Advanced Graph Neural Networks