NovaLAD: A Fast, CPU-Optimized Document Extraction Pipeline for Generative AI and Data Intelligence
Aman Ulla

TL;DR
NovaLAD is a CPU-optimized document extraction system that combines object detection, layout analysis, and vision-language models to efficiently convert unstructured documents into structured, multi-format outputs for AI applications.
Contribution
It introduces a novel, fast, CPU-based pipeline integrating YOLO detection, rule-based grouping, and vision-language models for comprehensive document parsing without GPU reliance.
Findings
Achieves 96.49% TEDS and 98.51% NID on DP-Bench
Outperforms commercial and open-source parsers in accuracy
Operates efficiently on CPU with parallel processing
Abstract
Document extraction is an important step before retrieval-augmented generation (RAG), knowledge bases, and downstream generative AI can work. It turns unstructured documents like PDFs and scans into structured text and layout-aware representations. We introduce NovaLAD, a comprehensive document parsing system that integrates two concurrent YOLO object detection models - element detection and layout detection - with rule-based grouping and optional vision-language enhancement. When a page image is sent in, the first thing that happens is that it goes through both models at the same time. The element model finds semantic content like the title, header, text, table, image, and so on, and the layout model finds structural regions like layout_box, column_group, multi_column, row_group, and so on. A key design decision is to first send an image or figure through an image classifier (ViT) that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Advanced Neural Network Applications · Image Retrieval and Classification Techniques
