ForCenNet: Foreground-Centric Network for Document Image Rectification

Peng Cai; Qiang Li; Kaicheng Yang; Dong Guo; Jia Li; Nan Zhou; Xiang An; Ninghua Yang; Jiankang Deng

arXiv:2507.19804·cs.CV·July 29, 2025

ForCenNet: Foreground-Centric Network for Document Image Rectification

Peng Cai, Qiang Li, Kaicheng Yang, Dong Guo, Jia Li, Nan Zhou, Xiang An, Ninghua Yang, Jiankang Deng

PDF

TL;DR

ForCenNet introduces a foreground-centric approach to document image rectification, emphasizing foreground elements to improve geometric correction and achieve state-of-the-art results on multiple benchmarks.

Contribution

The paper presents a novel foreground-centric label generation, mask mechanism, and curvature loss to enhance document rectification accuracy.

Findings

01

Achieves new state-of-the-art on four benchmarks.

02

Effectively undistorts text lines and table borders.

03

Outperforms existing methods in geometric correction.

Abstract

Document image rectification aims to eliminate geometric deformation in photographed documents to facilitate text recognition. However, existing methods often neglect the significance of foreground elements, which provide essential geometric references and layout information for document image correction. In this paper, we introduce Foreground-Centric Network (ForCenNet) to eliminate geometric distortions in document images. Specifically, we initially propose a foreground-centric label generation method, which extracts detailed foreground elements from an undistorted image. Then we introduce a foreground-centric mask mechanism to enhance the distinction between readable and background regions. Furthermore, we design a curvature consistency loss to leverage the detailed foreground labels to help the model understand the distorted geometric distribution. Extensive experiments demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.