Evaluating Transfer Learning Methods on Real-World Data Streams: A Case Study in Financial Fraud Detection
Ricardo Ribeiro Pereira, Jacopo Bono, Hugo Ferreira, Pedro Ribeiro, Carlos Soares, Pedro Bizarro

TL;DR
This paper introduces a framework for evaluating transfer learning methods in realistic, dynamic data scenarios, particularly in financial fraud detection, addressing the limitations of static evaluation assumptions.
Contribution
The authors propose a novel data manipulation framework that simulates varying data availability, domain variability, and concept shifts for more realistic transfer learning evaluation.
Findings
Framework enables simulation of realistic data scenarios over time
Case study demonstrates framework's applicability to real-world financial data
Framework improves understanding of transfer learning behavior in dynamic environments
Abstract
When the available data for a target domain is limited, transfer learning (TL) methods can be used to develop models on related data-rich domains, before deploying them on the target domain. However, these TL methods are typically designed with specific, static assumptions on the amount of available labeled and unlabeled target data. This is in contrast with many real world applications, where the availability of data and corresponding labels varies over time. Since the evaluation of the TL methods is typically also performed under the same static data availability assumptions, this would lead to unrealistic expectations concerning their performance in real world settings. To support a more realistic evaluation and comparison of TL algorithms and models, we propose a data manipulation framework that (1) simulates varying data availability scenarios over time, (2) creates multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
