FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding

Amit Agarwal; Srikant Panda; Kulbhushan Pachauri

arXiv:2505.17330·cs.CV·November 13, 2025

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding

Amit Agarwal, Srikant Panda, Kulbhushan Pachauri

PDF

TL;DR

FS-DAG is a scalable, efficient few-shot model for visually rich document understanding that adapts to diverse document types with minimal data, handling OCR errors and domain shifts effectively.

Contribution

Introduces FS-DAG, a modular, low-parameter model architecture for VRDU that improves performance and robustness in few-shot settings compared to existing methods.

Findings

01

Significant improvements in convergence speed and accuracy.

02

Robustness to OCR errors and domain shifts.

03

Achieves high performance with less than 90M parameters.

Abstract

In this work, we propose Few Shot Domain Adapting Graph (FS-DAG), a scalable and efficient model architecture for visually rich document understanding (VRDU) in few-shot settings. FS-DAG leverages domain-specific and language/vision specific backbones within a modular framework to adapt to diverse document types with minimal data. The model is robust to practical challenges such as handling OCR errors, misspellings, and domain shifts, which are critical in real-world deployments. FS-DAG is highly performant with less than 90M parameters, making it well-suited for complex real-world applications for Information Extraction (IE) tasks where computational resources are limited. We demonstrate FS-DAG's capability through extensive experiments for information extraction task, showing significant improvements in convergence speed and performance compared to state-of-the-art methods.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.