Automated Detection of Doxing on Twitter

Younes Karimi; Anna Squicciarini; Shomir Wilson

arXiv:2202.00879·cs.SI·November 15, 2022

Automated Detection of Doxing on Twitter

Younes Karimi, Anna Squicciarini, Shomir Wilson

PDF

Open Access

TL;DR

This paper develops and evaluates machine learning methods to automatically detect doxing on Twitter, achieving high accuracy and recall, to address the challenge of identifying sensitive personal information disclosures online.

Contribution

It introduces and compares nine detection approaches, including string-matching and embedding techniques, specifically tailored for identifying doxing on Twitter.

Findings

01

Achieved 96.86% accuracy in detection

02

Achieved 97.37% recall in detection

03

Identified effective use of contextualized string embeddings

Abstract

Doxing refers to the practice of disclosing sensitive personal information about a person without their consent. This form of cyberbullying is an unpleasant and sometimes dangerous phenomenon for online social networks. Although prior work exists on automated identification of other types of cyberbullying, a need exists for methods capable of detecting doxing on Twitter specifically. We propose and evaluate a set of approaches for automatically detecting second- and third-party disclosures on Twitter of sensitive private information, a subset of which constitutes doxing. We summarize our findings of common intentions behind doxing episodes and compare nine different approaches for automated detection based on string-matching and one-hot encoded heuristics, as well as word and contextualized string embedding representations of tweets. We identify an approach providing 96.86% accuracy and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting · Stalking, Cyberstalking, and Harassment