Starlit: Privacy-Preserving Federated Learning to Enhance Financial   Fraud Detection

Aydin Abadi; Bradley Doyle; Francesco Gini; Kieron Guinamard; Sasi; Kumar Murakonda; Jack Liddell; Paul Mellor; Steven J. Murdoch; Mohammad; Naseri; Hector Page; George Theodorakopoulos; Suzanne Weller

arXiv:2401.10765·cs.LG·January 23, 2024·6 cites

Starlit: Privacy-Preserving Federated Learning to Enhance Financial Fraud Detection

Aydin Abadi, Bradley Doyle, Francesco Gini, Kieron Guinamard, Sasi, Kumar Murakonda, Jack Liddell, Paul Mellor, Steven J. Murdoch, Mohammad, Naseri, Hector Page, George Theodorakopoulos, Suzanne Weller

PDF

Open Access

TL;DR

Starlit is a scalable, privacy-preserving federated learning framework designed to improve financial fraud detection by addressing security, scalability, and dropout resistance issues present in prior solutions.

Contribution

This paper introduces Starlit, a novel federated learning mechanism that overcomes key limitations of existing methods, including security proofs, scalability, and client dropout handling.

Findings

01

Starlit demonstrates high scalability and efficiency in synthetic financial data tests.

02

The framework achieves accurate fraud detection without compromising privacy.

03

Starlit's design effectively handles client dropouts and security concerns.

Abstract

Federated Learning (FL) is a data-minimization approach enabling collaborative model training across diverse clients with local data, avoiding direct data exchange. However, state-of-the-art FL solutions to identify fraudulent financial transactions exhibit a subset of the following limitations. They (1) lack a formal security definition and proof, (2) assume prior freezing of suspicious customers' accounts by financial institutions (limiting the solutions' adoption), (3) scale poorly, involving either $O (n^{2})$ computationally expensive modular exponentiation (where $n$ is the total number of financial institutions) or highly inefficient fully homomorphic encryption, (4) assume the parties have already completed the identity alignment phase, hence excluding it from the implementation, performance evaluation, and security analysis, and (5) struggle to resist clients' dropouts. This work…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Imbalanced Data Classification Techniques · Machine Learning in Healthcare