# Boundary-aware dual-discriminator generative adversarial network for data augmentation in financial transaction fraud detection

**Authors:** Honghao Zhu, Zhanchao Wang, Yu Xie, Jiamin Yao

PMC · DOI: 10.1371/journal.pone.0342095 · PLOS One · 2026-02-20

## TL;DR

This paper introduces BADGAN, a new GAN-based method for improving fraud detection by generating realistic synthetic fraud data.

## Contribution

BADGAN introduces a boundary-aware dual-discriminator mechanism to enhance synthetic fraud sample quality near decision boundaries.

## Key findings

- BADGAN outperforms existing methods in handling class imbalance in financial fraud detection.
- Synthetic samples generated by BADGAN improve the downstream classifier's ability to detect fraud.
- Experiments on real-world and public datasets validate BADGAN's effectiveness.

## Abstract

The rapid growth of digital payments exacerbates the challenges in Financial Transaction Fraud Detection (FTFD). These challenges stem primarily from an extreme class imbalance, where legitimate transactions greatly outnumber fraudulent ones. This imbalance significantly hampers the ability of FTFD models to accurately learn fraud patterns. Although existing data augmentation techniques have shown effectiveness in alleviating this problem, they are often negatively influenced by anomalous samples that diverge from the true fraud distribution due to fraudsters’ concealment strategies and the inherent complexity of fraudulent patterns. This divergence makes it challenging to accurately model the distribution of fraudulent activities. In this work, we propose a Boundary-Aware Dual-discriminator Generative Adversarial Network (BADGAN) to address the class imbalance issue in FTFD. BADGAN integrates a boundary sample classifier with a dual-constraint mechanism based on distance adversarial learning, allowing the generator to produce synthetic samples that both adhere to the distribution of real fraud data and maintain a distance from the decision boundary. This boundary-aware design emphasizes the optimization of sample quality near classification boundaries, thereby improving the downstream classifier’s ability to distinguish fraudulent behavior. Extensive experiments on both real-world and public datasets demonstrate that BADGAN outperforms its competitive peers in addressing the class imbalance issue, thereby enhancing the detection performance of FTFD models.

## Full-text entities

- **Genes:** CSRP3 (cysteine and glycine rich protein 3) [NCBI Gene 8048] {aka CLP, CMD1M, CMH12, CRP3, MLP}
- **Diseases:** medical anomaly (MESH:D000069279), COVID-19 (MESH:D000086382), poisoning (MESH:D011041)
- **Chemicals:** BADGAN (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12923028/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12923028/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/PMC12923028/full.md

---
Source: https://tomesphere.com/paper/PMC12923028