HiddenTables & PyQTax: A Cooperative Game and Dataset For TableQA to   Ensure Scale and Data Privacy Across a Myriad of Taxonomies

William Watson; Nicole Cho; Tucker Balch; Manuela Veloso

arXiv:2406.10803·cs.AI·June 18, 2024

HiddenTables & PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data Privacy Across a Myriad of Taxonomies

William Watson, Nicole Cho, Tucker Balch, Manuela Veloso

PDF

TL;DR

This paper introduces HiddenTables, a cooperative game framework for secure, scalable table question-answering with LLMs, and presents PyQTax, a large dataset for diverse question taxonomies, addressing challenges of data privacy, context limitations, and tokenization issues.

Contribution

The paper proposes HiddenTables as a novel cooperative game approach to enhance secure, scalable TableQA with LLMs and introduces PyQTax, a large dataset for diverse question types across numerous tables.

Findings

01

HiddenTables improves data security and scalability in TableQA tasks.

02

LLMs struggle with complex, compositional, and schema-specific queries.

03

PyQTax provides a comprehensive benchmark with over 116,000 question-answer pairs.

Abstract

A myriad of different Large Language Models (LLMs) face a common challenge in contextually analyzing table question-answering tasks. These challenges are engendered from (1) finite context windows for large tables, (2) multi-faceted discrepancies amongst tokenization patterns against cell boundaries, and (3) various limitations stemming from data confidentiality in the process of using external models such as gpt-3.5-turbo. We propose a cooperative game dubbed "HiddenTables" as a potential resolution to this challenge. In essence, "HiddenTables" is played between the code-generating LLM "Solver" and the "Oracle" which evaluates the ability of the LLM agents to solve Table QA tasks. This game is based on natural language schemas and importantly, ensures the security of the underlying data. We provide evidential experiments on a diverse set of tables that demonstrate an LLM's collective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training · ALIGN