DataTales: A Benchmark for Real-World Intelligent Data Narration

Yajing Yang; Qian Liu; Min-Yen Kan

arXiv:2410.17859·cs.AI·August 26, 2025

DataTales: A Benchmark for Real-World Intelligent Data Narration

Yajing Yang, Qian Liu, Min-Yen Kan

PDF

Open Access 1 Repo

TL;DR

DataTales is a new benchmark that evaluates language models on their ability to generate clear, analytical narratives from complex financial data, addressing a gap in existing evaluation tools.

Contribution

It introduces a large, real-world dataset and benchmark for assessing data narration capabilities of language models in financial contexts.

Findings

01

Language models struggle with precision and analytical depth in data narration.

02

The benchmark reveals significant challenges faced by current models.

03

Provides a foundation for future model improvements and evaluation methods.

Abstract

We introduce DataTales, a novel benchmark designed to assess the proficiency of language models in data narration, a task crucial for transforming complex tabular data into accessible narratives. Existing benchmarks often fall short in capturing the requisite analytical complexity for practical applications. DataTales addresses this gap by offering 4.9k financial reports paired with corresponding market data, showcasing the demand for models to create clear narratives and analyze large datasets while understanding specialized terminology in the field. Our findings highlights the significant challenge that language models face in achieving the necessary precision and analytical depth for proficient data narration, suggesting promising avenues for future model development and evaluation methodologies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yajingyang/datatales
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Data Visualization and Analytics