Augmenting Rule-based DNS Censorship Detection at Scale with Machine   Learning

Jacob Brown; Xi Jiang; Van Tran; Arjun Nitin Bhagoji; Nguyen Phong; Hoang; Nick Feamster; Prateek Mittal; Vinod Yegneswaran

arXiv:2302.02031·cs.LG·June 19, 2023·1 cites

Augmenting Rule-based DNS Censorship Detection at Scale with Machine Learning

Jacob Brown, Xi Jiang, Van Tran, Arjun Nitin Bhagoji, Nguyen Phong, Hoang, Nick Feamster, Prateek Mittal, Vinod Yegneswaran

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates how machine learning models can enhance DNS censorship detection by automating processes, uncovering new censorship signatures, and improving detection reliability over traditional heuristic methods.

Contribution

It introduces supervised and unsupervised ML approaches for DNS censorship detection, revealing new censorship instances and signatures missed by existing heuristics.

Findings

01

Supervised models learn existing detection heuristics effectively.

02

Unsupervised models identify new censorship instances without prior labels.

03

Both methods uncover additional DNS blocking signatures overlooked by heuristics.

Abstract

The proliferation of global censorship has led to the development of a plethora of measurement platforms to monitor and expose it. Censorship of the domain name system (DNS) is a key mechanism used across different countries. It is currently detected by applying heuristics to samples of DNS queries and responses (probes) for specific destinations. These heuristics, however, are both platform-specific and have been found to be brittle when censors change their blocking behavior, necessitating a more reliable automated process for detecting censorship. In this paper, we explore how machine learning (ML) models can (1) help streamline the detection process, (2) improve the potential of using large-scale datasets for censorship detection, and (3) discover new censorship instances and blocking signatures missed by existing heuristic methods. Our study shows that supervised models, trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

noise-lab/automated-dns-censorship
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInternet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection · Hate Speech and Cyberbullying Detection