Statistical Detection of Collective Data Fraud

Ruoyu Wang (1; 2); Xiaobo Hu (3); Daniel Sun (2; 4; 5); Guoqiang; Li (1; 4); Raymond Wong (2); Shiping Chen (5); Jianquan Liu (6) ((1); Shanghai Jiao Tong University; (2) University of New South Wales; (3) Peking; University; (4) Enhitech Co. Ltd.; (5) Data61; CSIRO; (6) Biometrics Research; Laboratories; NEC Corporation)

arXiv:2001.00688·cs.DB·November 20, 2020

Statistical Detection of Collective Data Fraud

Ruoyu Wang (1, 2), Xiaobo Hu (3), Daniel Sun (2, 4, 5), Guoqiang, Li (1, 4), Raymond Wong (2), Shiping Chen (5), Jianquan Liu (6) ((1), Shanghai Jiao Tong University, (2) University of New South Wales, (3) Peking, University, (4) Enhitech Co. Ltd., (5) Data61, CSIRO

PDF

TL;DR

This paper introduces a statistical divergence-based method for detecting collective data fraud by analyzing distribution similarities among data collections, demonstrating its practical applicability in real-world scenarios.

Contribution

It proposes a novel collective detection technique that leverages statistical divergence to identify anomalies in data collections, extending beyond multimedia processing.

Findings

01

Effective in real-world data fraud detection scenarios

02

Utilizes distribution similarity for anomaly detection

03

Applicable across various data domains

Abstract

Statistical divergence is widely applied in multimedia processing, basically due to regularity and interpretable features displayed in data. However, in a broader range of data realm, these advantages may no longer be feasible, and therefore a more general approach is required. In data detection, statistical divergence can be used as a similarity measurement based on collective features. In this paper, we present a collective detection technique based on statistical divergence. The technique extracts distribution similarities among data collections, and then uses the statistical divergence to detect collective anomalies. Evaluation shows that it is applicable in the real world.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.