Benchmarking Multi-Agent LLM Architectures for Financial Document Processing: A Comparative Study of Orchestration Patterns, Cost-Accuracy Tradeoffs and Production Scaling Strategies
Siddhant Kulkarni, Yukta Kulkarni

TL;DR
This study systematically benchmarks four multi-agent LLM architectures for financial document extraction, analyzing their accuracy, cost, and scalability to guide deployment decisions in regulated environments.
Contribution
It introduces a comprehensive empirical comparison of orchestration patterns, evaluates multiple LLMs on real financial data, and offers insights into cost-accuracy tradeoffs and scaling strategies.
Findings
Reflexive architectures achieve highest accuracy (F1 0.943) but at higher cost.
Hierarchical architectures offer the best cost-accuracy balance (F1 0.921 at 1.4x cost).
Hybrid configurations recover 89% of reflexive accuracy at lower costs.
Abstract
The adoption of large language models (LLMs) for structured information extraction from financial documents has accelerated rapidly, yet production deployments face fundamental architectural decisions with limited empirical guidance. We present a systematic benchmark comparing four multi-agent orchestration architectures: sequential pipeline, parallel fan-out with merge, hierarchical supervisor-worker and reflexive self-correcting loop. These are evaluated across five frontier and open-weight LLMs on a corpus of 10,000 SEC filings (10-K, 10-Q and 8-K forms). Our evaluation spans 25 extraction field types covering governance structures, executive compensation and financial metrics, measured along five axes: field-level F1, document-level accuracy, end-to-end latency, cost per document and token efficiency. We find that reflexive architectures achieve the highest field-level F1 (0.943)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Reporting and XBRL · Stock Market Forecasting Methods · Auditing, Earnings Management, Governance
