Form2Seq : A Framework for Higher-Order Form Structure Extraction
Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy

TL;DR
Form2Seq introduces a novel text-based sequence-to-sequence framework for higher-order form structure extraction, outperforming segmentation methods by leveraging spatial and textual data to classify elements and group them into form structures.
Contribution
The paper presents a new Seq2Seq-based approach for form structure extraction that effectively uses spatial and textual information, improving accuracy over segmentation methods.
Findings
Achieved 90% accuracy on element classification.
Attained F1 scores of 75.82, 86.01, 61.63 on grouping tasks.
State-of-the-art results on ICDAR 2013 table recognition.
Abstract
Document structure extraction has been a widely researched area for decades with recent works performing it as a semantic segmentation task over document images using fully-convolution networks. Such methods are limited by image resolution due to which they fail to disambiguate structures in dense regions which appear commonly in forms. To mitigate this, we propose Form2Seq, a novel sequence-to-sequence (Seq2Seq) inspired framework for structure extraction using text, with a specific focus on forms, which leverages relative spatial arrangement of structures. We discuss two tasks; 1) Classification of low-level constituent elements (TextBlock and empty fillable Widget) into ten types such as field captions, list items, and others; 2) Grouping lower-level elements into higher-order constructs, such as Text Fields, ChoiceFields and ChoiceGroups, used as information collection mechanism in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Topic Modeling
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence
