Asymptotically Optimal Bounds for Estimating H-Index in Sublinear Time with Applications to Subgraph Counting
Sepehr Assadi, Hoai-An Nguyen

TL;DR
This paper introduces a sublinear time algorithm for estimating the h-index with optimal bounds and applies the technique to subgraph counting, providing new theoretical insights and lower bounds.
Contribution
It presents the first asymptotically optimal bounds for h-index estimation in sublinear time and introduces a novel lower bound technique applicable to subgraph counting.
Findings
Algorithm achieves (1±ε)-approximation with high probability in optimal time.
Lower bound matches the algorithm's performance, proving optimality.
Technique extends to subgraph counting, improving previous bounds.
Abstract
The -index is a metric used to measure the impact of a user in a publication setting, such as a member of a social network with many highly liked posts or a researcher in an academic domain with many highly cited publications. Specifically, the -index of a user is the largest integer such that at least publications of the user have at least units of positive feedback. We design an algorithm that, given query access to the publications of a user and each publication's corresponding positive feedback number, outputs a -approximation of the -index of this user with probability at least in time \[ O(\frac{n \cdot \ln{(1/\delta)}}{\varepsilon^2 \cdot h}), \] where is the actual -index which is unknown to the algorithm a-priori. We then design a novel lower bound technique that allows us to prove that this bound is in fact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Optimization and Search Problems · Machine Learning and Algorithms
