Synopsis: Secure and private trend inference from encrypted semantic embeddings

Madelyne Xiao; Palak Jain; Micha Gorelick; Sarah Scheffler

arXiv:2505.23880·cs.CR·June 2, 2025

Synopsis: Secure and private trend inference from encrypted semantic embeddings

Madelyne Xiao, Palak Jain, Micha Gorelick, Sarah Scheffler

PDF

Open Access 1 Repo

TL;DR

Synopsis presents a secure system enabling analysis of encrypted messaging data for trend inference, balancing privacy with investigative utility through innovative cryptographic and differential privacy techniques.

Contribution

It introduces a novel architecture combining differential privacy and secure multi-party computation for analyzing encrypted message embeddings in messaging platforms.

Findings

01

Queries execute in about 30 seconds

02

Achieves over 94% accuracy on benchmark tasks

03

Demonstrates efficiency and privacy-preserving analysis on WhatsApp data

Abstract

WhatsApp and many other commonly used communication platforms guarantee end-to-end encryption (E2EE), which requires that service providers lack the cryptographic keys to read communications on their own platforms. WhatsApp's privacy-preserving design makes it difficult to study important phenomena like the spread of misinformation or political messaging, as users have a clear expectation and desire for privacy and little incentive to forfeit that privacy in the process of handing over raw data to researchers, journalists, or other parties. We introduce Synopsis, a secure architecture for analyzing messaging trends in consensually-donated E2EE messages using message embeddings. Since the goal of this system is investigative journalism workflows, Synopsis must facilitate both exploratory and targeted analyses -- a challenge for systems using differential privacy (DP), and, for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

digital-witness-lab/synopsis
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling