Towards Ethical Content-Based Detection of Online Influence Campaigns

Evan Crothers; Nathalie Japkowicz; Herna Viktor

arXiv:1908.11030·cs.CY·December 11, 2019

Towards Ethical Content-Based Detection of Online Influence Campaigns

Evan Crothers, Nathalie Japkowicz, Herna Viktor

PDF

1 Repo

TL;DR

This paper explores a text-based method for detecting online influence campaigns, introducing named entity masking to improve robustness and discussing ethical implications of language biases.

Contribution

It proposes named entity masking to reduce false positives in influence detection and highlights ethical challenges related to language biases.

Findings

01

Named entity masking reduces false positives when key entities are mentioned.

02

Both masked and unmasked models show increased false positives with Russian native speakers.

03

Ethical considerations regarding language bias are identified for future research.

Abstract

The detection of clandestine efforts to influence users in online communities is a challenging problem with significant active development. We demonstrate that features derived from the text of user comments are useful for identifying suspect activity, but lead to increased erroneous identifications when keywords over-represented in past influence campaigns are present. Drawing on research in native language identification (NLI), we use "named entity masking" (NEM) to create sentence features robust to this shortcoming, while maintaining comparable classification accuracy. We demonstrate that while NEM consistently reduces false positives when key named entities are mentioned, both masked and unmasked models exhibit increased false positive rates on English sentences by Russian native speakers, raising ethical considerations that should be addressed in future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ecrows/l2-reddit-experiment
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.