Urania: Differentially Private Insights into AI Use

Daogao Liu; Edith Cohen; Badih Ghazi; Peter Kairouz; Pritish Kamath; Alexander Knop; Ravi Kumar; Pasin Manurangsi; Adam Sealfon; Da Yu; Chiyuan Zhang

arXiv:2506.04681·cs.LG·September 25, 2025

Urania: Differentially Private Insights into AI Use

Daogao Liu, Edith Cohen, Badih Ghazi, Peter Kairouz, Pritish Kamath, Alexander Knop, Ravi Kumar, Pasin Manurangsi, Adam Sealfon, Da Yu, Chiyuan Zhang

PDF

TL;DR

Urania is a new framework that provides privacy-preserving insights into AI chatbot interactions using differential privacy, innovative keyword extraction, and rigorous evaluation to balance data utility with user privacy.

Contribution

Urania introduces a comprehensive DP framework with novel keyword extraction and privacy evaluation methods for analyzing LLM interactions.

Findings

01

Effective preservation of lexical and semantic content.

02

Comparable insights to non-private pipelines.

03

Enhanced robustness in privacy guarantees.

Abstract

We introduce $U r ania$ , a novel framework for generating insights about LLM chatbot interactions with rigorous differential privacy (DP) guarantees. The framework employs a private clustering mechanism and innovative keyword extraction methods, including frequency-based, TF-IDF-based, and LLM-guided approaches. By leveraging DP tools such as clustering, partition selection, and histogram-based summarization, $U r ania$ provides end-to-end privacy protection. Our evaluation assesses lexical and semantic content preservation, pair similarity, and LLM-based metrics, benchmarking against a non-private Clio-inspired pipeline (Tamkin et al., 2024). Moreover, we develop a simple empirical privacy evaluation that demonstrates the enhanced robustness of our DP pipeline. The results show the framework's ability to extract meaningful conversational insights while maintaining stringent user privacy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.