Federated and Privacy-Preserving Learning of Accounting Data in Financial Statement Audits
Marco Schreyer, Timur Sattarov, Damian Borth

TL;DR
This paper proposes a federated learning framework with privacy-preserving techniques for training deep learning models on sensitive accounting data, enabling auditors to detect anomalies across multiple clients without compromising data confidentiality.
Contribution
It introduces a novel federated learning approach incorporating differential privacy and split learning for privacy-preserving audit data analysis.
Findings
Effective anomaly detection in real-world datasets
Models benefit from multi-client data aggregation
Privacy measures mitigate data leakage risks
Abstract
The ongoing 'digital transformation' fundamentally changes audit evidence's nature, recording, and volume. Nowadays, the International Standards on Auditing (ISA) requires auditors to examine vast volumes of a financial statement's underlying digital accounting records. As a result, audit firms also 'digitize' their analytical capabilities and invest in Deep Learning (DL), a successful sub-discipline of Machine Learning. The application of DL offers the ability to learn specialized audit models from data of multiple clients, e.g., organizations operating in the same industry or jurisdiction. In general, regulations require auditors to adhere to strict data confidentiality measures. At the same time, recent intriguing discoveries showed that large-scale DL models are vulnerable to leaking sensitive training data information. Today, it often remains unclear how audit firms can apply DL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques
