Agentar-Fin-OCR

Siyi Qian; Xiongfei Bai; Bingtao Fu; Yichen Lu; Gaoyang Zhang; Xudong Yang; Peng Zhang

arXiv:2603.11044·cs.CV·March 12, 2026

Agentar-Fin-OCR

Siyi Qian, Xiongfei Bai, Bingtao Fu, Yichen Lu, Gaoyang Zhang, Xudong Yang, Peng Zhang

PDF

Open Access

TL;DR

Agentar-Fin-OCR is a specialized document parsing system designed for financial PDFs, combining novel algorithms and training strategies to accurately extract structured data from complex, multi-page financial documents, validated on a new benchmark.

Contribution

The paper introduces Agentar-Fin-OCR, a comprehensive system with novel algorithms and a new benchmark, FinDocBench, tailored for high-accuracy financial document parsing.

Findings

01

High performance on table parsing metrics of OmniDocBench

02

Effective handling of complex layouts and cross-page structures

03

Robust evaluation of models on financial documents

Abstract

In this paper, we propose Agentar-Fin-OCR, a document parsing system tailored to financial-domain documents, transforming ultra-long financial PDFs into semantically consistent, highly accurate, structured outputs with auditing-grade provenance. To address finance-specific challenges such as complex layouts, cross-page structural discontinuities, and cell-level referencing capability, Agentar-Fin-OCR combines (1) a Cross-page Contents Consolidation algorithm to restore continuity across pages and a Document-level Heading Hierarchy Reconstruction (DHR) module to build a globally consistent Table of Contents (TOC) tree for structure-aware retrieval, and (2) a difficulty-adaptive curriculum learning training strategy for table parsing, together with a CellBBoxRegressor module that uses structural anchor tokens to localize table cells from decoder hidden states without external detectors.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Topic Modeling · Digital Humanities and Scholarship