Starlit: Privacy-Preserving Federated Learning to Enhance Financial Fraud Detection
Aydin Abadi, Bradley Doyle, Francesco Gini, Kieron Guinamard, Sasi, Kumar Murakonda, Jack Liddell, Paul Mellor, Steven J. Murdoch, Mohammad, Naseri, Hector Page, George Theodorakopoulos, Suzanne Weller

TL;DR
Starlit is a scalable, privacy-preserving federated learning framework designed to improve financial fraud detection by addressing security, scalability, and dropout resistance issues present in prior solutions.
Contribution
This paper introduces Starlit, a novel federated learning mechanism that overcomes key limitations of existing methods, including security proofs, scalability, and client dropout handling.
Findings
Starlit demonstrates high scalability and efficiency in synthetic financial data tests.
The framework achieves accurate fraud detection without compromising privacy.
Starlit's design effectively handles client dropouts and security concerns.
Abstract
Federated Learning (FL) is a data-minimization approach enabling collaborative model training across diverse clients with local data, avoiding direct data exchange. However, state-of-the-art FL solutions to identify fraudulent financial transactions exhibit a subset of the following limitations. They (1) lack a formal security definition and proof, (2) assume prior freezing of suspicious customers' accounts by financial institutions (limiting the solutions' adoption), (3) scale poorly, involving either computationally expensive modular exponentiation (where is the total number of financial institutions) or highly inefficient fully homomorphic encryption, (4) assume the parties have already completed the identity alignment phase, hence excluding it from the implementation, performance evaluation, and security analysis, and (5) struggle to resist clients' dropouts. This work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Imbalanced Data Classification Techniques · Machine Learning in Healthcare
