Open Banking Foundational Model: Learning Language Representations from Few Financial Transactions

Gustavo Polleti; Marlesson Santana; Eduardo Fontes

arXiv:2511.12154·cs.LG·November 18, 2025

Open Banking Foundational Model: Learning Language Representations from Few Financial Transactions

Gustavo Polleti, Marlesson Santana, Eduardo Fontes

PDF

Open Access

TL;DR

This paper presents a multimodal foundational model for financial transactions that combines structured and unstructured data, improving performance especially in data-scarce Open Banking scenarios and demonstrating cross-institutional generalization.

Contribution

It introduces a novel multimodal, self-supervised model for financial transaction representation, pioneering large-scale cross-institutional analysis in North America.

Findings

01

Outperforms classical feature engineering methods

02

Effective in data-scarce Open Banking scenarios

03

Generalizes across different financial institutions

Abstract

We introduced a multimodal foundational model for financial transactions that integrates both structured attributes and unstructured textual descriptions into a unified representation. By adapting masked language modeling to transaction sequences, we demonstrated that our approach not only outperforms classical feature engineering and discrete event sequence methods but is also particularly effective in data-scarce Open Banking scenarios. To our knowledge, this is the first large-scale study across thousands of financial institutions in North America, providing evidence that multimodal representations can generalize across geographies and institutions. These results highlight the potential of self-supervised models to advance financial applications ranging from fraud prevention and credit risk to customer insights

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinancial Distress and Bankruptcy Prediction · Imbalanced Data Classification Techniques · Stock Market Forecasting Methods