Automated Detection of Doxing on Twitter
Younes Karimi, Anna Squicciarini, Shomir Wilson

TL;DR
This paper develops and evaluates machine learning methods to automatically detect doxing on Twitter, achieving high accuracy and recall, to address the challenge of identifying sensitive personal information disclosures online.
Contribution
It introduces and compares nine detection approaches, including string-matching and embedding techniques, specifically tailored for identifying doxing on Twitter.
Findings
Achieved 96.86% accuracy in detection
Achieved 97.37% recall in detection
Identified effective use of contextualized string embeddings
Abstract
Doxing refers to the practice of disclosing sensitive personal information about a person without their consent. This form of cyberbullying is an unpleasant and sometimes dangerous phenomenon for online social networks. Although prior work exists on automated identification of other types of cyberbullying, a need exists for methods capable of detecting doxing on Twitter specifically. We propose and evaluate a set of approaches for automatically detecting second- and third-party disclosures on Twitter of sensitive private information, a subset of which constitutes doxing. We summarize our findings of common intentions behind doxing episodes and compare nine different approaches for automated detection based on string-matching and one-hot encoded heuristics, as well as word and contextualized string embedding representations of tweets. We identify an approach providing 96.86% accuracy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting · Stalking, Cyberstalking, and Harassment
