MADP: A Multi-Agent Pipeline for Sustainable Document Processing with Human-in-the-Loop
Diego Gosmar, Giovanni Zenezini

TL;DR
MADP introduces a multi-agent system combining AI and human oversight for efficient, accurate, and sustainable enterprise document processing, significantly reducing manual effort and environmental impact.
Contribution
The paper presents MADP, a novel multi-agent architecture with a unique Prompt Fine Tuning with Feedback Inheritance method, enhancing automation and sustainability in enterprise document workflows.
Findings
Achieves 97.0% pipeline automation rate on real documents.
Reduces FTE requirements by approximately 70%.
Cuts CO2 emissions, energy, and water usage by around 69%.
Abstract
Document processing automation remains a critical challenge in enterprise environments, where traditional manual approaches are labor-intensive and error-prone. We present MADP, a multi-agent architecture that addresses the challenge of automating document processing in enterprise settings by combining deep learning-based classification and parsing with large language model extraction, while maintaining accuracy through selective human validation. Our system integrates five specialized agents--Classificator, Splitter, Parser, Extraction, and Validator--with a Human-in-the-Loop (HITL) mechanism and a novel Prompt Fine Tuning with Feedback Inheritance (PFTFI) approach. The operational analysis on a production use-case scenario of 100,000 invoices per year indicates a potential reduction of Full-Time Equivalent (FTE) requirements by approximately 70%. Production deployment on 955…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
