Enhancing credit card fraud detection using DBSCAN-augmented disjunctive voting ensemble
Mahmoud A. Ghalwash, Samir Mohamed Abdelrazek, Nabila Hamid Eladawi, Haitham A. Ghalwash

TL;DR
This paper introduces a new method for credit card fraud detection that combines clustering and ensemble learning to better identify rare fraud cases.
Contribution
A novel hybrid framework using DBSCAN for data augmentation and disjunctive voting ensemble for high recall in fraud detection.
Findings
DBSCAN-based augmentation improves minority-class representation while preserving fraud patterns.
The DVE strategy achieves up to 99.5% recall and 99.8% F1-scores in fraud detection.
The framework outperforms traditional methods with 100% accuracy and precision.
Abstract
Credit card fraud detection remains a critical yet challenging task due to the extreme class imbalance inherent in transaction datasets, where fraudulent activities constitute only a small fraction of the total records. To address this imbalance and enhance the detection of rare fraud instances, this study proposes a novel hybrid framework that integrates density-based clustering for data augmentation with an ensemble classification strategy optimized for high recall. In the preprocessing stage, the framework employs density-based spatial clustering of applications with noise (DBSCAN) to identify minority-class clusters and synthetically augment the fraud class. This preserves the intrinsic structure of fraudulent patterns while increasing their representation in the training set. Subsequently, an ensemble model comprising random forest (RF), K-nearest neighbors (KNN), and support…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Financial Distress and Bankruptcy Prediction · Electricity Theft Detection Techniques
