Temporal Contrastive Transformer for Financial Crime Detection: Self-Supervised Sequence Embeddings via Predictive Contrastive Coding
Danny Butvinik (NICE Actimize), Yonit Marcus (NICE Actimize), Nitzan Tal (NICE Actimize), Gabrielle Azoulay (NICE Actimize)

TL;DR
The paper presents TCT, a self-supervised transformer model that learns temporal embeddings from financial transaction sequences, showing promising predictive performance for fraud detection.
Contribution
Introduction of TCT, a contrastive learning framework for capturing temporal dynamics in financial sequences, reducing reliance on manual feature engineering.
Findings
Embeddings achieve AUC 0.8644 for fraud detection.
Combining embeddings with domain features yields no significant improvement.
Learned representations approximate existing domain-specific features.
Abstract
We introduce the Temporal Contrastive Transformer (TCT), a representation learning framework designed to capture contextual temporal dynamics in sequences of financial transactions. The model is trained using a self-supervised contrastive objective to produce embeddings that encode behavioral patterns over time, with the goal of supporting downstream fraud detection tasks. We evaluate TCT in a realistic setting by using the learned embeddings as input features to a gradient boosting classifier. Experimental results show that embeddings alone achieve meaningful predictive performance (AUC 0.8644), indicating that the model captures non-trivial temporal structure. However, when combined with domain-engineered features, no measurable improvement is observed over the baseline (AUC 0.9205 vs. 0.9245), suggesting that the learned representations largely overlap with existing feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
