Puda: Private User Dataset Agent for User-Sovereign and Privacy-Preserving Personalized AI
Akinori Maeda, Yuto Sekiya, Sota Sugimura, Tomoya Asai, Yu Tsuda, Kohei Ikeda, Hiroshi Fujii, Kohei Watanabe

TL;DR
Puda is a browser-based system that empowers users to control personal data sharing at multiple privacy levels, enabling effective personalized AI services while preserving user sovereignty and privacy.
Contribution
It introduces a user-sovereign architecture for data aggregation and client-side management, balancing personalization and privacy in AI services.
Findings
Predefined Category Subsets achieve 97.2% of full data personalization performance.
Puda enables multi-granularity data management for privacy-preserving personalization.
Effective privacy-personalization trade-off mitigation demonstrated in travel planning task.
Abstract
Personal data centralization among dominant platform providers including search engines, social networking services, and e-commerce has created siloed ecosystems that restrict user sovereignty, thereby impeding data use across services. Meanwhile, the rapid proliferation of Large Language Model (LLM)-based agents has intensified demand for highly personalized services that require the dynamic provision of diverse personal data. This presents a significant challenge: balancing the utilization of such data with privacy protection. To address this challenge, we propose Puda (Private User Dataset Agent), a user-sovereign architecture that aggregates data across services and enables client-side management. Puda allows users to control data sharing at three privacy levels: (i) Detailed Browsing History, (ii) Extracted Keywords, and (iii) Predefined Category Subsets. We implemented Puda as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Recommender Systems and Techniques
