Predicting and Explaining Customer Data Sharing in the Open Banking
Jo\~ao B. G. de Brito, Rodrigo Heldt, Cleo S. Silveira, Matthias Bogaert, Guilherme B. Bucco, Fernando B. Luce, Jo\~ao L. Becker, Filipe J. Zabala, Michel J. Anzanello

TL;DR
This paper presents a predictive framework using machine learning to determine customer data sharing behavior in Open Banking, along with interpretability analysis revealing key influencing factors.
Contribution
It introduces a hybrid data balancing approach and combines XGBoost with EMA for accurate prediction and explanation of customer data sharing in Open Banking.
Findings
Achieved over 91% accuracy in predicting data inflow and outflow.
Identified mobile transactions and credit usage as key factors influencing sharing behavior.
Demonstrated the effectiveness of combining SHAP and CART for interpretability.
Abstract
The emergence of Open Banking represents a significant shift in financial data management, influencing financial institutions' market dynamics and marketing strategies. This increased competition creates opportunities and challenges, as institutions manage data inflow to improve products and services while mitigating data outflow that could aid competitors. This study introduces a framework to predict customers' propensity to share data via Open Banking and interprets this behavior through Explanatory Model Analysis (EMA). Using data from a large Brazilian financial institution with approximately 3.2 million customers, a hybrid data balancing strategy incorporating ADASYN and NEARMISS techniques was employed to address the infrequency of data sharing and enhance the training of XGBoost models. These models accurately predicted customer data sharing, achieving 91.39% accuracy for inflow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
