LDI: Localized Data Imputation for Text-Rich Tables

Soroush Omidvartehrani; Davood Rafiei

arXiv:2506.16616·cs.DB·May 12, 2026

LDI: Localized Data Imputation for Text-Rich Tables

Soroush Omidvartehrani, Davood Rafiei

PDF

1 Repo

TL;DR

LDI introduces a localized reasoning framework using LLMs for imputing missing values in text-rich tables, enhancing accuracy, scalability, and interpretability over existing methods.

Contribution

The paper presents a novel localized reasoning approach with targeted attribute and tuple selection, improving imputation accuracy and transparency in text-rich tables.

Findings

01

LDI achieves up to 8% higher accuracy than state-of-the-art methods.

02

Localized reasoning reduces noise and improves scalability.

03

LDI offers transparent attribution for each imputed value.

Abstract

Missing values are pervasive in real-world tabular data and can significantly impair downstream analysis. Imputing them is especially challenging in text-rich tables, where dependencies are implicit, complex, and dispersed across long textual fields. Recent work has explored using Large Language Models (LLMs) for data imputation, yet existing approaches typically process entire tables or loosely related contexts, which can compromise accuracy, scalability, and explainability. We introduce LDI, a novel framework that leverages LLMs through localized reasoning, selecting a compact, contextually relevant subset of attributes and tuples for each missing value. This targeted selection reduces noise, improves scalability, and provides transparent attribution by revealing the dependency relations that justify each selected attribute and the evidence behind each retrieved tuple. It makes clear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

soroushomidvar/LDI
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.