OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis
Sumit Shekhar, Bhanu Prakash Reddy Guda, Ashutosh Chaubey, Ishan, Jindal, Avneet Jain

TL;DR
OPAD is a reinforcement learning-based active learning framework that optimizes sample selection for complex document content detection tasks, reducing annotation costs and improving performance, especially in weakly labeled scenarios.
Contribution
The paper introduces OPAD, a novel reinforcement policy for active learning in document content detection, extending to weak labeling and incorporating class imbalance and user feedback rewards.
Findings
OPAD outperforms existing methods in document understanding tasks.
Incorporating user feedback improves annotation efficiency.
OPAD reduces annotation time significantly in experiments.
Abstract
Documents are central to many business systems, and include forms, reports, contracts, invoices or purchase orders. The information in documents is typically in natural language, but can be organized in various layouts and formats. There have been recent spurt of interest in understanding document content with novel deep learning architectures. However, document understanding tasks need dense information annotations, which are costly to scale and generalize. Several active learning techniques have been proposed to reduce the overall budget of annotation while maintaining the performance of the underlying deep learning model. However, most of these techniques work only for classification problems. But content detection is a more complex task, and has been scarcely explored in active learning literature. In this paper, we propose \textit{OPAD}, a novel framework using reinforcement policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Handwritten Text Recognition Techniques · Oil and Gas Production Techniques
