Toward provably private analytics and insights into GenAI use
Albert Cheu, Artem Lagzdin, Brett McLarnon, Daniel Ramage, Katharine Daly, Marco Gruteser, Peter Kairouz, Rakshita Tandon, Stanislav Chiknavaryan, Timon Van Overveldt, and Zoe Gong

TL;DR
This paper introduces a federated analytics system leveraging Trusted Execution Environments to ensure verifiable privacy and security guarantees for processing data in large-scale, real-world GenAI applications.
Contribution
It presents a novel TEE-based federated analytics architecture with verifiable privacy guarantees, supporting flexible workloads including unstructured data and differential privacy.
Findings
Successfully deployed in production for real-world GenAI insights
Provides verifiable privacy guarantees through TEE technology
Supports processing unstructured data with LLMs and differential privacy
Abstract
Large-scale systems that compute analytics over a fleet of devices must achieve high privacy and security standards while also meeting data quality, usability, and resource efficiency expectations. We present a next-generation federated analytics system that uses Trusted Execution Environments (TEEs) based on technologies like AMD SEV-SNP and Intel TDX to provide verifiable privacy guarantees for all server-side processing. In our system, devices encrypt and upload data, tagging it with a limited set of allowable server-side processing steps. An open source, TEE-hosted key management service guarantees that the data is accessible only to those steps, which are themselves protected by TEE confidentiality and integrity assurance guarantees. The system is designed for flexible workloads, including processing unstructured data with LLMs (for structured summarization) before aggregation into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
