ProUIE: A Macro-to-Micro Progressive Learning Method for LLM-based Universal Information Extraction

Wenda Liu; Zhigang Song; Shuai Nie; Guangyao Liu; Lisung Chen; Binyu Yang; Yaran Chen; Peng Zhou; Hongzhen Wang; Yuchen Liu; Wenyue Hu; Jiaming Xu; Runyu Shi; Ying Huang

arXiv:2604.10633·cs.CL·April 14, 2026

ProUIE: A Macro-to-Micro Progressive Learning Method for LLM-based Universal Information Extraction

Wenda Liu, Zhigang Song, Shuai Nie, Guangyao Liu, Lisung Chen, Binyu Yang, Yaran Chen, Peng Zhou, Hongzhen Wang, Yuchen Liu, Wenyue Hu, Jiaming Xu, Runyu Shi, Ying Huang

PDF

TL;DR

ProUIE introduces a three-stage macro-to-micro progressive learning framework for universal information extraction with LLMs, enhancing performance without external data across diverse datasets.

Contribution

It presents a novel macro-to-micro learning approach that improves UIE by sequentially modeling at different granularities without external information.

Findings

01

Outperforms instruction-tuned baselines on 36 datasets for NER and RE.

02

Achieves better results with a smaller backbone.

03

Shows significant gains in large-scale production-oriented IE.

Abstract

LLM-based universal information extraction (UIE) methods often rely on additional information beyond the original training data, which increases training complexity yet often yields limited gains. To address this, we propose ProUIE, a Macro-to-Micro progressive learning approach that improves UIE without introducing any external information. ProUIE consists of three stages: (i) macro-level Complete Modeling (CM), which learns NER, RE, and EE along their intrinsic difficulty order on the full training data to build a unified extraction foundation, (ii) meso-level Streamlined Alignment (SA), which operates on sampled data with simplified target formats, streamlining and regularizing structured outputs to make them more concise and controllable, and (iii) micro-level Deep Exploration (DE), which applies GRPO with stepwise fine-grained rewards (SFR) over structural units to guide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.