Almost Optimal Bounds for Sublinear-Time Sampling of $k$-Cliques: Sampling Cliques is Harder Than Counting
Talya Eden, Dana Ron, Will Rosenbaum

TL;DR
This paper establishes nearly tight bounds for sublinear-time algorithms that sample $k$-cliques uniformly in graphs, revealing a fundamental complexity separation between counting and sampling tasks.
Contribution
It provides the first tight bounds for sublinear-time $k$-clique sampling, demonstrating a separation from approximate counting in the same regime.
Findings
Lower bounds are derived using graph constructions with hidden cliques.
Upper bounds are achieved via auxiliary graph constructions and existing edge-sampling algorithms.
The results show sampling is strictly harder than counting in the sublinear regime.
Abstract
In this work, we consider the problem of sampling a -clique in a graph from an almost uniform distribution in sublinear time in the general graph query model. Specifically the algorithm should output each -clique with probability , where denotes the number of -cliques in the graph and is a given approximation parameter. We prove that the query complexity of this problem is \[ \Theta^*\left(\max\left\{ \left(\frac{(n\alpha)^{k/2}}{ n_k}\right)^{\frac{1}{k-1}} ,\; \min\left\{n\alpha,\frac{n\alpha^{k-1}}{n_k} \right\}\right\}\right). \] where is the number of vertices in the graph, is its arboricity, and suppresses the dependence on . Interestingly, this establishes a separation between approximate counting and approximate uniform sampling in the sublinear regime. For example, if ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
