STAR: Semantic Table Representation with Header-Aware Clustering and Adaptive Weighted Fusion
Shui-Hsiang Hsu, Tsung-Hsiang Chou, Chen-Jui Yu, Yao-Chung Fan

TL;DR
STAR introduces a novel framework for semantic table representation that uses header-aware clustering and weighted fusion to improve table retrieval accuracy across multiple benchmarks.
Contribution
The paper presents STAR, a lightweight method that enhances semantic table representations through clustering and adaptive fusion, outperforming previous approaches like QGpT.
Findings
STAR achieves higher recall than QGpT on five benchmarks.
Semantic clustering improves the diversity of partial table samples.
Weighted fusion enhances fine-grained semantic alignment.
Abstract
Table retrieval is the task of retrieving the most relevant tables from large-scale corpora given natural language queries. However, structural and semantic discrepancies between unstructured text and structured tables make embedding alignment particularly challenging. Recent methods such as QGpT attempt to enrich table semantics by generating synthetic queries, yet they still rely on coarse partial-table sampling and simple fusion strategies, which limit semantic diversity and hinder effective query-table alignment. We propose STAR (Semantic Table Representation), a lightweight framework that improves semantic table representation through semantic clustering and weighted fusion. STAR first applies header-aware K-means clustering to group semantically similar rows and selects representative centroid instances to construct a diverse partial table. It then generates cluster-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Time Series Analysis and Forecasting · Handwritten Text Recognition Techniques
