Human-In-The-Loop Document Layout Analysis
Xingjiao Wu, Tianlong Ma, Xin Li, Qin Chen, Liang He

TL;DR
This paper introduces a human-in-the-loop approach for document layout analysis that efficiently selects key samples for active labeling, significantly improving model performance with minimal labeled data.
Contribution
It proposes the Key Samples Selection (KSS) method and a reinforcement learning-inspired update strategy to enhance DLA accuracy while reducing labeling costs.
Findings
Achieved 86.3% accuracy on DSSE-200 with 10% labeled data.
Achieved 95.6% accuracy on CS-150 with 10% labeled data.
Reduced labeling effort while maintaining high performance.
Abstract
Document layout analysis (DLA) aims to divide a document image into different types of regions. DLA plays an important role in the document content understanding and information extraction systems. Exploring a method that can use less data for effective training contributes to the development of DLA. We consider a Human-in-the-loop (HITL) collaborative intelligence in the DLA. Our approach was inspired by the fact that the HITL push the model to learn from the unknown problems by adding a small amount of data based on knowledge. The HITL select key samples by using confidence. However, using confidence to find key samples is not suitable for DLA tasks. We propose the Key Samples Selection (KSS) method to find key samples in high-level tasks (semantic segmentation) more accurately through agent collaboration, effectively reducing costs. Once selected, these key samples are passed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Advanced Data Storage Technologies · Scientific Computing and Data Management
MethodsDeep Layer Aggregation
