User-Centered Security in Natural Language Processing
Chris Emmery

TL;DR
This paper introduces a user-centered security framework for NLP, focusing on privacy and content moderation challenges, and explores adversarial attacks to enhance security and robustness in these areas.
Contribution
It proposes a novel framework for user-centered security in NLP and investigates adversarial attacks to improve privacy and classifier robustness.
Findings
Adversarial attacks can obfuscate author profiling models.
Adversarial samples help evaluate cyberbullying detection robustness.
Framework enhances understanding of security vulnerabilities in NLP.
Abstract
This dissertation proposes a framework of user-centered security in Natural Language Processing (NLP), and demonstrates how it can improve the accessibility of related research. Accordingly, it focuses on two security domains within NLP with great public interest. First, that of author profiling, which can be employed to compromise online privacy through invasive inferences. Without access and detailed insight into these models' predictions, there is no reasonable heuristic by which Internet users might defend themselves from such inferences. Secondly, that of cyberbullying detection, which by default presupposes a centralized implementation; i.e., content moderation across social platforms. As access to appropriate data is restricted, and the nature of the task rapidly evolves (both through lexical variation, and cultural shifts), the effectiveness of its classifiers is greatly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting · Cybercrime and Law Enforcement Studies
