TabSieve: Explicit In-Table Evidence Selection for Tabular Prediction

Yongyao Wang; Ziqi Miao; Lu Yang; Haonan Jia; Wenting Yan; Chen Qian; Lijun Li

arXiv:2602.11700·cs.LG·February 13, 2026

TabSieve: Explicit In-Table Evidence Selection for Tabular Prediction

Yongyao Wang, Ziqi Miao, Lu Yang, Haonan Jia, Wenting Yan, Chen Qian, Lijun Li

PDF

Open Access

TL;DR

TabSieve is a framework that explicitly selects relevant table rows as evidence before predicting missing values, improving robustness and accuracy in tabular prediction tasks.

Contribution

It introduces a select-then-predict approach with a synthesized training dataset and reinforcement learning for joint evidence selection and prediction optimization.

Findings

01

Improves classification accuracy by 2.92% and regression by 4.45% over baselines.

02

Enhances robustness to noisy context by focusing on selected evidence.

03

Demonstrates consistent performance gains across various shot budgets.

Abstract

Tabular prediction can benefit from in-table rows as few-shot evidence, yet existing tabular models typically perform instance-wise inference and LLM-based prompting is often brittle. Models do not consistently leverage relevant rows, and noisy context can degrade performance. To address this challenge, we propose TabSieve, a select-then-predict framework that makes evidence usage explicit and auditable. Given a table and a query row, TabSieve first selects a small set of informative rows as evidence and then predicts the missing target conditioned on the selected evidence. To enable this capability, we construct TabSieve-SFT-40K by synthesizing high-quality reasoning trajectories from 331 real tables using a strong teacher model with strict filtering. Furthermore, we introduce TAB-GRPO, a reinforcement learning recipe that jointly optimizes evidence selection and prediction correctness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Explainable Artificial Intelligence (XAI)