Statistical Detection of Collective Data Fraud
Ruoyu Wang (1, 2), Xiaobo Hu (3), Daniel Sun (2, 4, 5), Guoqiang, Li (1, 4), Raymond Wong (2), Shiping Chen (5), Jianquan Liu (6) ((1), Shanghai Jiao Tong University, (2) University of New South Wales, (3) Peking, University, (4) Enhitech Co. Ltd., (5) Data61, CSIRO

TL;DR
This paper introduces a statistical divergence-based method for detecting collective data fraud by analyzing distribution similarities among data collections, demonstrating its practical applicability in real-world scenarios.
Contribution
It proposes a novel collective detection technique that leverages statistical divergence to identify anomalies in data collections, extending beyond multimedia processing.
Findings
Effective in real-world data fraud detection scenarios
Utilizes distribution similarity for anomaly detection
Applicable across various data domains
Abstract
Statistical divergence is widely applied in multimedia processing, basically due to regularity and interpretable features displayed in data. However, in a broader range of data realm, these advantages may no longer be feasible, and therefore a more general approach is required. In data detection, statistical divergence can be used as a similarity measurement based on collective features. In this paper, we present a collective detection technique based on statistical divergence. The technique extracts distribution similarities among data collections, and then uses the statistical divergence to detect collective anomalies. Evaluation shows that it is applicable in the real world.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
