Scalable Approximate Biclique Counting over Large Bipartite Graphs
Jingbang Chen, Weinuo Li, Yingli Zhou, Hangrui Zhou, Qiuyang Mang, Can Wang, Yixiang Fang, and Chenhao Ma

TL;DR
This paper introduces a scalable approximate method for counting $(p,q)$-bicliques in large bipartite graphs, using novel graph structures and sampling techniques to achieve high accuracy and efficiency.
Contribution
The authors propose a new $(p,q)$-broom structure and sampling algorithm that provide unbiased estimates with error guarantees for approximate biclique counting.
Findings
Outperforms existing methods in accuracy, reducing error by up to 8 times.
Achieves significant runtime speedup, up to 50 times faster.
Effective on nine real-world bipartite networks, demonstrating scalability.
Abstract
Counting -bicliques in bipartite graphs is crucial for a variety of applications, from recommendation systems to cohesive subgraph analysis. Yet, it remains computationally challenging due to the combinatorial explosion to exactly count the -bicliques. In many scenarios, e.g., graph kernel methods, however, exact counts are not strictly required. To design a scalable and high-quality approximate solution, we novelly resort to -broom, a special spanning tree of the -biclique, which can be counted via graph coloring and efficient dynamic programming. Based on the intermediate results of the dynamic programming, we propose an efficient sampling algorithm to derive the approximate -biclique count from the -broom counts. Theoretically, our method offers unbiased estimates with provable error guarantees. Empirically, our solution outperforms existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Algorithms and Data Compression · Bayesian Modeling and Causal Inference
