Structuring the Unstructured: A Systematic Review of Text-to-Structure Generation for Agentic AI with a Universal Evaluation Framework

Zheye Deng; Chunkit Chan; Tianshi Zheng; Wei Fan; Weiqi Wang; Yangqiu Song

arXiv:2508.12257·cs.CL·August 19, 2025

Structuring the Unstructured: A Systematic Review of Text-to-Structure Generation for Agentic AI with a Universal Evaluation Framework

Zheye Deng, Chunkit Chan, Tianshi Zheng, Wei Fan, Weiqi Wang, Yangqiu Song

PDF

Open Access

TL;DR

This paper systematically reviews methods for converting unstructured text into structured formats, evaluates existing datasets and metrics, and proposes a universal evaluation framework to advance agentic AI capabilities.

Contribution

It provides a comprehensive synthesis of text-to-structure techniques, datasets, and evaluation metrics, and introduces a universal framework for assessing structured outputs in AI.

Findings

01

Current datasets lack diversity and standardization.

02

Evaluation metrics are inconsistent across studies.

03

The proposed framework enables consistent assessment of structured outputs.

Abstract

The evolution of AI systems toward agentic operation and context-aware retrieval necessitates transforming unstructured text into structured formats like tables, knowledge graphs, and charts. While such conversions enable critical applications from summarization to data mining, current research lacks a comprehensive synthesis of methodologies, datasets, and metrics. This systematic review examines text-to-structure techniques and the encountered challenges, evaluates current datasets and assessment criteria, and outlines potential directions for future research. We also introduce a universal evaluation framework for structured outputs, establishing text-to-structure as foundational infrastructure for next-generation AI systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Topic Modeling · Natural Language Processing Techniques