Quick Detection of High-degree Entities in Large Directed Networks
Konstantin Avrachenkov, Nelly Litvak, Liudmila Ostroumova, Prokhorenkova, Eugenia Suyargulova

TL;DR
This paper introduces a fast, two-stage randomized algorithm for identifying top high-degree entities in large directed networks, achieving high accuracy with minimal API requests and leveraging Extreme Value Theory for analysis.
Contribution
The work presents a novel, efficient algorithm for detecting top entities in large networks and provides a theoretical analysis using Extreme Value Theory to explain its effectiveness.
Findings
Requires only 1,000 API requests to find top-100 users in Twitter
Achieves over 90% precision in identifying top entities
Number of requests is sublinear in total number of entities
Abstract
In this paper, we address the problem of quick detection of high-degree entities in large online social networks. Practical importance of this problem is attested by a large number of companies that continuously collect and update statistics about popular entities, usually using the degree of an entity as an approximation of its popularity. We suggest a simple, efficient, and easy to implement two-stage randomized algorithm that provides highly accurate solutions for this problem. For instance, our algorithm needs only one thousand API requests in order to find the top-100 most followed users in Twitter, a network with approximately a billion of registered users, with more than 90% precision. Our algorithm significantly outperforms existing methods and serves many different purposes, such as finding the most popular users or the most popular interest groups in social networks. An…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Bayesian Methods and Mixture Models · Stochastic processes and statistical mechanics
