Detecting Network-based Internet Censorship via Latent Feature Representation Learning
Shawn P. Duncan, Hui Chen

TL;DR
This paper introduces machine learning models, including autoencoders and image-based CNNs, to detect network-based Internet censorship more effectively than rule-based methods, capturing nuanced censorship instances.
Contribution
It proposes novel latent feature and image-based classification models that automatically detect censorship without relying on manually crafted rules.
Findings
Models detect censorship instances missed by fingerprint-based methods.
Latent feature approach uncovers more diverse censorship cases.
Both models outperform traditional rule-based detection.
Abstract
Internet censorship is a phenomenon of societal importance and attracts investigation from multiple disciplines. Several research groups, such as Censored Planet, have deployed large scale Internet measurement platforms to collect network reachability data. However, existing studies generally rely on manually designed rules (i.e., using censorship fingerprints) to detect network-based Internet censorship from the data. While this rule-based approach yields a high true positive detection rate, it suffers from several challenges: it requires human expertise, is laborious, and cannot detect any censorship not captured by the rules. Seeking to overcome these challenges, we design and evaluate a classification model based on latent feature representation learning and an image-based classification model to detect network-based Internet censorship. To infer latent feature representations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Hate Speech and Cyberbullying Detection · Cybercrime and Law Enforcement Studies
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Tanh Activation · Concatenated Skip Connection · Batch Normalization · Sigmoid Activation · Six Ways To Communicate To Someone At Expedia Via Phone And Email's. · Dropout · Kaiming Initialization · Softmax · Dense Connections
