Efficient Densest Flow Queries in Transaction Flow Networks (Complete Version)
Jiaxin Jiang, Yunxiang Zhao, Lyu Xu, Byron Choi, Bingsheng He, Shixuan Sun, Jia Chen

TL;DR
This paper introduces an efficient algorithm for identifying densest flow subgraphs in transaction networks, crucial for fraud detection, with industry validation demonstrating significant performance improvements.
Contribution
It proposes the \\(S\\-\\)\\(T\\) densest flow query, proves its NP-hardness, and develops the CONAN divide-and-conquer algorithm with an approximate flow-peeling method for scalable fraud detection.
Findings
CONAN outperforms baseline methods by up to 1000x in runtime.
The approach effectively detects fraudulent dense flows in real-world transaction networks.
Applied in industry, it significantly improves fraud detection accuracy.
Abstract
Transaction flow networks are crucial in detecting illicit activities such as wash trading, credit card fraud, cashback arbitrage fraud, and money laundering. \revise{Our collaborator, Grab, a leader in digital payments in Southeast Asia, faces increasingly sophisticated fraud patterns in its transaction flow networks. In industry settings such as Grab's fraud detection pipeline, identifying fraudulent activities heavily relies on detecting dense flows within transaction networks. Motivated by this practical foundation,} we propose the \emph{\(S\)-\(T\) densest flow} (\SDMF{}) query. Given a transaction flow network \( G \), a source set \( \Src \), a sink set \( \Dst \), and a size threshold \( k \), the query outputs subsets \( \Src' \subseteq \Src \) and \( \Dst' \subseteq \Dst \) such that the maximum flow from \( \Src' \) to \( \Dst' \) is densest, with \(|\Src' \cup \Dst'| \geq…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Data Mining Algorithms and Applications · Advanced Database Systems and Queries
