Scalable Approximate Biclique Counting over Large Bipartite Graphs

Jingbang Chen; Weinuo Li; Yingli Zhou; Hangrui Zhou; Qiuyang Mang; Can Wang; Yixiang Fang; and Chenhao Ma

arXiv:2505.10471·cs.SI·December 9, 2025

Scalable Approximate Biclique Counting over Large Bipartite Graphs

Jingbang Chen, Weinuo Li, Yingli Zhou, Hangrui Zhou, Qiuyang Mang, Can Wang, Yixiang Fang, and Chenhao Ma

PDF

Open Access

TL;DR

This paper introduces a scalable approximate method for counting $(p,q)$-bicliques in large bipartite graphs, using novel graph structures and sampling techniques to achieve high accuracy and efficiency.

Contribution

The authors propose a new $(p,q)$-broom structure and sampling algorithm that provide unbiased estimates with error guarantees for approximate biclique counting.

Findings

01

Outperforms existing methods in accuracy, reducing error by up to 8 times.

02

Achieves significant runtime speedup, up to 50 times faster.

03

Effective on nine real-world bipartite networks, demonstrating scalability.

Abstract

Counting $(p, q)$ -bicliques in bipartite graphs is crucial for a variety of applications, from recommendation systems to cohesive subgraph analysis. Yet, it remains computationally challenging due to the combinatorial explosion to exactly count the $(p, q)$ -bicliques. In many scenarios, e.g., graph kernel methods, however, exact counts are not strictly required. To design a scalable and high-quality approximate solution, we novelly resort to $(p, q)$ -broom, a special spanning tree of the $(p, q)$ -biclique, which can be counted via graph coloring and efficient dynamic programming. Based on the intermediate results of the dynamic programming, we propose an efficient sampling algorithm to derive the approximate $(p, q)$ -biclique count from the $(p, q)$ -broom counts. Theoretically, our method offers unbiased estimates with provable error guarantees. Empirically, our solution outperforms existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Management and Algorithms · Algorithms and Data Compression · Bayesian Modeling and Causal Inference