Analyzing Disparity and Temporal Progression of Internet Quality through Crowdsourced Measurements with Bias-Correction
Hyeongseong Lee, Udit Paul, Arpit Gupta, Elizabeth Belding and, Mengyang Gu

TL;DR
This study analyzes crowdsourced internet speed data, identifies regional sampling biases linked to demographics, and proposes correction methods to improve the accuracy of internet performance assessments over time.
Contribution
The paper introduces new bias-correction methods for crowdsourced internet measurements and explores their relationship with demographic factors.
Findings
Sampling bias varies across regions and correlates with socioeconomic variables.
Bias correction reduces discrepancies in city-wide internet speed estimates.
Internet speeds have increased over time, with bias correction slightly adjusting the magnitude of this trend.
Abstract
Crowdsourced speedtest measurements are an important tool for studying internet performance from the end user perspective. Nevertheless, despite the accuracy of individual measurements, simplistic aggregation of these data points is problematic due to their intrinsic sampling bias. In this work, we utilize a dataset of nearly 1 million individual Ookla Speedtest measurements, correlate each datapoint with 2019 Census demographic data, and develop new methods to present a novel analysis to quantify regional sampling bias and the relationship of internet performance to demographic profile. We find that the crowdsourced Ookla Speedtest data points contain significant sampling bias across different census block groups based on a statistical test of homogeneity. We introduce two methods to correct the regional bias by the population of each census block group. Whereas the sampling bias leads…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Social Media and Politics · Image and Video Quality Assessment
