Red is Sus: Automated Identification of Low-Quality Service Availability   Claims in the US National Broadband Map

Syed Tauhidun Nabi; Zhuowei Wen; Brooke Ritter; Shaddi Hasan

arXiv:2410.08518·cs.NI·October 18, 2024

Red is Sus: Automated Identification of Low-Quality Service Availability Claims in the US National Broadband Map

Syed Tauhidun Nabi, Zhuowei Wen, Brooke Ritter, Shaddi Hasan

PDF

1 Repo

TL;DR

This paper presents a machine learning approach to automatically detect low-quality or potentially false broadband availability claims in the US National Broadband Map, improving data reliability for policymakers and the public.

Contribution

It introduces a novel dataset of broadband observations and a high-accuracy classification model to assess the integrity of service provider claims.

Findings

01

Achieved AUCs over 0.98 in classifying claim accuracy

02

Developed a dataset with 750k observations from 900+ ISPs

03

Provided a tool for identifying unreliable broadband data

Abstract

The FCC's National Broadband Map aspires to provide an unprecedented view into broadband availability in the US. However, this map, which also determines eligibility for public grant funding, relies on self-reported data from service providers that in turn have incentives to strategically misrepresent their coverage. In this paper, we develop an approach for automatically identifying these low-quality service claims in the National Broadband Map. To do this, we develop a novel dataset of broadband availability consisting of 750k observations from more than 900 US ISPs, derived from a combination of regulatory data and crowdsourced speed tests. Using this dataset, we develop a model to classify the accuracy of service provider regulatory filings and achieve AUCs over 0.98 for unseen examples. Our approach provides an effective technique to enable policymakers, civil society, and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

spin-vt/red_is_sus
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.