CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large   Language Models

Haoxiang Shi; Jiaan Wang; Jiarong Xu; Cen Wang; Tetsuya Sakai

arXiv:2405.12174·cs.CL·May 21, 2024

CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models

Haoxiang Shi, Jiaan Wang, Jiarong Xu, Cen Wang, Tetsuya Sakai

PDF

Open Access

TL;DR

This paper introduces CT-Eval, a comprehensive Chinese text-to-table dataset, to benchmark and enhance large language models' ability to generate structured tables from unstructured Chinese texts, addressing language limitations in existing datasets.

Contribution

The paper presents the first large-scale Chinese text-to-table dataset, CT-Eval, with data diversity and hallucination filtering, enabling effective evaluation and fine-tuning of LLMs for Chinese text-to-table tasks.

Findings

01

Zero-shot LLMs, including GPT-4, perform poorly compared to humans.

02

Fine-tuning open-source LLMs significantly improves their performance.

03

Open-source LLMs can outperform GPT-4 after fine-tuning.

Abstract

Text-to-Table aims to generate structured tables to convey the key information from unstructured documents. Existing text-to-table datasets are typically oriented English, limiting the research in non-English languages. Meanwhile, the emergence of large language models (LLMs) has shown great success as general task solvers in multi-lingual settings (e.g., ChatGPT), theoretically enabling text-to-table in other languages. In this paper, we propose a Chinese text-to-table dataset, CT-Eval, to benchmark LLMs on this task. Our preliminary analysis of English text-to-table datasets highlights two key factors for dataset construction: data diversity and data hallucination. Inspired by this, the CT-Eval dataset selects a popular Chinese multidisciplinary online encyclopedia as the source and covers 28 domains to ensure data diversity. To minimize data hallucination, we first train an LLM to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsAttention Is All You Need · Dense Connections · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding · Adam · Dropout