DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models
Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu,, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto

TL;DR
This paper introduces DocKD, a knowledge distillation framework from LLMs that leverages external document knowledge to improve small visual document understanding models, especially on out-of-domain tasks.
Contribution
The paper proposes a novel DocKD framework that enriches data generation for knowledge distillation by integrating external document knowledge, enhancing model generalization.
Findings
DocKD produces high-quality document annotations.
Models trained with DocKD data outperform direct distillation methods.
Student models trained with DocKD data perform well on out-of-domain tasks.
Abstract
Visual document understanding (VDU) is a challenging task that involves understanding documents across various modalities (text and image) and layouts (forms, tables, etc.). This study aims to enhance generalizability of small VDU models by distilling knowledge from LLMs. We identify that directly prompting LLMs often fails to generate informative and useful data. In response, we present a new framework (called DocKD) that enriches the data generation process by integrating external document knowledge. Specifically, we provide an LLM with various document elements like key-value pairs, layouts, and descriptions, to elicit open-ended answers. Our experiments show that DocKD produces high-quality document annotations and surpasses the direct knowledge distillation approach that does not leverage external document knowledge. Moreover, student VDU models trained with solely DocKD-generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsKnowledge Distillation
