Private Text Classification
Leif W. Hanlen, Richard Nock, Hanna Suominen, Neil Bacon

TL;DR
This paper investigates privacy-preserving methods for text classification, employing techniques like homomorphic encryption and secure computation to enable analysis on confidential text data without compromising privacy.
Contribution
It introduces a framework combining various privacy-preserving techniques for text analytics and develops preliminary methods for binary classifiers on private text corpora.
Findings
Demonstrated the feasibility of privacy-preserving text classification.
Developed initial approaches for binary classifiers using Rademacher operators.
Showed how to adapt text processing to privacy-preserving techniques.
Abstract
Confidential text corpora exist in many forms, but do not allow arbitrary sharing. We explore how to use such private corpora using privacy preserving text analytics. We construct typical text processing applications using appropriate privacy preservation techniques (including homomorphic encryption, Rademacher operators and secure computation). We set out the preliminary materials from Rademacher operators for binary classifiers, and then construct basic text processing approaches to match those binary classifiers.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling
