UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents

Yifan Ji; Zhipeng Xu; Zhenghao Liu; Zulong Chen; Qian Zhang; Zhibo Yang; Junyang Lin; Yu Gu; Ge Yu; Maosong Sun

arXiv:2602.07038·cs.CV·April 27, 2026

UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents

Yifan Ji, Zhipeng Xu, Zhenghao Liu, Zulong Chen, Qian Zhang, Zhibo Yang, Junyang Lin, Yu Gu, Ge Yu, Maosong Sun

PDF

1 Repo

TL;DR

UNIKIE-BENCH provides a comprehensive benchmark for evaluating large multimodal models' ability to extract key information from diverse visual documents, revealing significant challenges and disparities in current model performance.

Contribution

It introduces a unified benchmark with two evaluation tracks for assessing LMMs on KIE tasks, highlighting persistent challenges and performance gaps.

Findings

01

Performance drops significantly with complex layouts and long-tail key fields.

02

Large disparities in model performance across document types and scenarios.

03

Grounding accuracy and layout reasoning remain challenging for LMMs.

Abstract

Key Information Extraction (KIE) from real-world documents remains challenging due to substantial variations in layout structures, visual quality, and task-specific information requirements. Recent Large Multimodal Models (LMMs) have shown promising potential for performing end-to-end KIE directly from document images. To enable a comprehensive and systematic evaluation across realistic and diverse application scenarios, we introduce UNIKIE-BENCH, a unified benchmark designed to rigorously evaluate the KIE capabilities of LMMs. UNIKIE-BENCH consists of two complementary tracks: a constrained-category KIE track with scenario-predefined schemas that reflect practical application needs, and an open-category KIE track that extracts any key information that is explicitly present in the document. Experiments on 15 state-of-the-art LMMs reveal substantial performance degradation under diverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NEUIR/UNIKIE-BENCH
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.