Sell Data to AI Algorithms Without Revealing It: Secure Data Valuation and Sharing via Homomorphic Encryption
Michael Yang, Ruijiang Gao, Zhiqiang Zheng

TL;DR
This paper introduces a privacy-preserving protocol that allows data owners to securely quantify and share data value with AI developers using homomorphic encryption, without revealing the raw data.
Contribution
It presents the Trustworthy Influence Protocol (TIP), integrating homomorphic encryption with influence functions to enable secure, scalable data valuation for AI models.
Findings
Encrypted valuation signals correlate highly with clinical utility
The approach maintains near-plaintext accuracy with low-rank gradient projections
Data value distribution is heavy-tailed, impacting compensation models
Abstract
The rapid expansion of Artificial Intelligence is hindered by a fundamental friction in data markets: the value-privacy dilemma, where buyers cannot verify a dataset's utility without inspection, yet inspection may expose the data (Arrow's Information Paradox). We resolve this challenge by introducing the Trustworthy Influence Protocol (TIP), a privacy-preserving framework that enables prospective buyers to quantify the utility of external data without ever decrypting the raw assets. By integrating Homomorphic Encryption with gradient-based influence functions, our approach allows for the precise, blinded scoring of data points against a buyer's specific AI model. To ensure scalability for Large Language Models (LLMs), we employ low-rank gradient projections that reduce computational overhead while maintaining near-perfect fidelity to plaintext baselines, as demonstrated across BERT and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Blockchain Technology Applications and Security
