Automated Population-Level Audit Assurance via AI-Based Document Intelligence

Santosh Vasudevan; Velu Natarajan

arXiv:2605.05252·cs.SE·May 8, 2026

Automated Population-Level Audit Assurance via AI-Based Document Intelligence

Santosh Vasudevan, Velu Natarajan

PDF

TL;DR

This paper introduces an AI-driven framework for automating large-scale audit transaction testing, replacing manual PDF reviews with scalable, real-time discrepancy detection using document intelligence.

Contribution

It presents a novel automated approach leveraging Snowflake Document AI for extracting and reconciling data from unstructured PDFs at scale.

Findings

01

Enabled population-level audit testing rather than sampling

02

Improved audit coverage and continuous assurance capabilities

03

Demonstrated scalable, near real-time risk identification

Abstract

Audit transaction testing validates accuracy and completeness of customer-facing statements against internal systems of record. Traditional manual, sample-based review of unstructured PDF statements is labor-intensive and does not scale to millions of transactions. This paper presents an automated framework for large-scale audit transaction testing using AI-based document intelligence. The solution leverages Snowflake Document AI to extract structured data from unstructured PDF statements using a small labeled corpus (approximately 20 documents). Extracted data are reconciled against authoritative source-of-truth datasets to identify discrepancies at scale. Results are surfaced through interactive dashboards and automated reports. The framework enables population-level testing rather than sampling-based approaches, improving audit coverage and supporting continuous assurance objectives.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.