MADP: A Multi-Agent Pipeline for Sustainable Document Processing with Human-in-the-Loop

Diego Gosmar; Giovanni Zenezini

arXiv:2605.17159·cs.AI·May 19, 2026

MADP: A Multi-Agent Pipeline for Sustainable Document Processing with Human-in-the-Loop

Diego Gosmar, Giovanni Zenezini

PDF

TL;DR

MADP introduces a multi-agent system combining AI and human oversight for efficient, accurate, and sustainable enterprise document processing, significantly reducing manual effort and environmental impact.

Contribution

The paper presents MADP, a novel multi-agent architecture with a unique Prompt Fine Tuning with Feedback Inheritance method, enhancing automation and sustainability in enterprise document workflows.

Findings

01

Achieves 97.0% pipeline automation rate on real documents.

02

Reduces FTE requirements by approximately 70%.

03

Cuts CO2 emissions, energy, and water usage by around 69%.

Abstract

Document processing automation remains a critical challenge in enterprise environments, where traditional manual approaches are labor-intensive and error-prone. We present MADP, a multi-agent architecture that addresses the challenge of automating document processing in enterprise settings by combining deep learning-based classification and parsing with large language model extraction, while maintaining accuracy through selective human validation. Our system integrates five specialized agents--Classificator, Splitter, Parser, Extraction, and Validator--with a Human-in-the-Loop (HITL) mechanism and a novel Prompt Fine Tuning with Feedback Inheritance (PFTFI) approach. The operational analysis on a production use-case scenario of 100,000 invoices per year indicates a potential reduction of Full-Time Equivalent (FTE) requirements by approximately 70%. Production deployment on 955…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.