Federated and Privacy-Preserving Learning of Accounting Data in   Financial Statement Audits

Marco Schreyer; Timur Sattarov; Damian Borth

arXiv:2208.12708·cs.LG·August 29, 2022·1 cites

Federated and Privacy-Preserving Learning of Accounting Data in Financial Statement Audits

Marco Schreyer, Timur Sattarov, Damian Borth

PDF

Open Access

TL;DR

This paper proposes a federated learning framework with privacy-preserving techniques for training deep learning models on sensitive accounting data, enabling auditors to detect anomalies across multiple clients without compromising data confidentiality.

Contribution

It introduces a novel federated learning approach incorporating differential privacy and split learning for privacy-preserving audit data analysis.

Findings

01

Effective anomaly detection in real-world datasets

02

Models benefit from multi-client data aggregation

03

Privacy measures mitigate data leakage risks

Abstract

The ongoing 'digital transformation' fundamentally changes audit evidence's nature, recording, and volume. Nowadays, the International Standards on Auditing (ISA) requires auditors to examine vast volumes of a financial statement's underlying digital accounting records. As a result, audit firms also 'digitize' their analytical capabilities and invest in Deep Learning (DL), a successful sub-discipline of Machine Learning. The application of DL offers the ability to learn specialized audit models from data of multiple clients, e.g., organizations operating in the same industry or jurisdiction. In general, regulations require auditors to adhere to strict data confidentiality measures. At the same time, recent intriguing discoveries showed that large-scale DL models are vulnerable to leaking sensitive training data information. Today, it often remains unclear how audit firms can apply DL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques