Corporate Fraud Detection in Rich-yet-Noisy Financial Graph
Shiqi Wang, Zhibo Zhang, Libing Fang, Cam-Tu Nguyen, Wenzhong Li

TL;DR
This paper introduces a novel graph-based method, KeGCN_R, for corporate fraud detection that effectively handles noisy financial graphs and hidden frauds, demonstrating superior performance and robustness.
Contribution
The paper proposes KeGCN_R, a knowledge-enhanced GCN with a two-stage learning process, to improve fraud detection in noisy, complex financial graphs.
Findings
KeGCN_R outperforms baseline models in detection accuracy.
Knowledge graph embeddings mitigate information overload.
Two-stage learning enhances robustness against hidden frauds.
Abstract
Corporate fraud detection aims to automatically recognize companies that conduct wrongful activities such as fraudulent financial statements or illegal insider trading. Previous learning-based methods fail to effectively integrate rich interactions in the company network. To close this gap, we collect 18-year financial records in China to form three graph datasets with fraud labels. We analyze the characteristics of the financial graphs, highlighting two pronounced issues: (1) information overload: the dominance of (noisy) non-company nodes over company nodes hinders the message-passing process in Graph Convolution Networks (GCN); and (2) hidden fraud: there exists a large percentage of possible undetected violations in the collected data. The hidden fraud problem will introduce noisy labels in the training dataset and compromise fraud detection results. To handle such challenges, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Advanced Graph Neural Networks · Financial Distress and Bankruptcy Prediction
MethodsGraph Convolutional Network · Convolution
