Catch Me If You Can: Deceiving Stance Detection and Geotagging Models to   Protect Privacy of Individuals on Twitter

Dilara Dogan; Bahadir Altun; Muhammed Said Zengin; Mucahid Kutlu and; Tamer Elsayed

arXiv:2207.11500·cs.CL·July 26, 2022

Catch Me If You Can: Deceiving Stance Detection and Geotagging Models to Protect Privacy of Individuals on Twitter

Dilara Dogan, Bahadir Altun, Muhammed Said Zengin, Mucahid Kutlu and, Tamer Elsayed

PDF

Open Access

TL;DR

This paper explores simple text modification techniques to evade NLP models like stance detection and geotagging on Twitter, revealing effective methods to protect user privacy against automated detection.

Contribution

It demonstrates that inserting typos can significantly reduce stance detection accuracy, and interacting with different users can hinder geotagging models, offering practical privacy-preserving strategies.

Findings

01

Typos decrease BERT-based stance detection accuracy.

02

Paraphrasing does not significantly affect stance detection.

03

User interaction reduces geotagging model performance by nearly 50%.

Abstract

The recent advances in natural language processing have yielded many exciting developments in text analysis and language understanding models; however, these models can also be used to track people, bringing severe privacy concerns. In this work, we investigate what individuals can do to avoid being detected by those models while using social media platforms. We ground our investigation in two exposure-risky tasks, stance detection and geotagging. We explore a variety of simple techniques for modifying text, such as inserting typos in salient words, paraphrasing, and adding dummy social media posts. Our experiments show that the performance of BERT-based models fined tuned for stance detection decreases significantly due to typos, but it is not affected by paraphrasing. Moreover, we find that typos have minimal impact on state-of-the-art geotagging models due to their increased reliance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Privacy, Security, and Data Protection · Freedom of Expression and Defamation