Provable Differentially Private Computation of the Cross-Attention Mechanism
Yekun Ke, Yingyu Liang, Zhenmei Shi, Zhao Song, Jiahao Zhang

TL;DR
This paper introduces a novel differentially private data structure for cross-attention mechanisms in AI models, providing provable privacy guarantees while maintaining efficiency and robustness against adaptive queries.
Contribution
It presents the first provably differentially private cross-attention mechanism with theoretical guarantees and practical efficiency for large generative models.
Findings
Achieves $ ilde{O}(ndr^2)$ space and initialization complexity.
Satisfies $( ext{epsilon}, ext{delta})$-DP with bounded error.
Maintains robustness against adaptive, adversarial queries.
Abstract
Cross-attention has emerged as a cornerstone module in modern artificial intelligence, underpinning critical applications such as retrieval-augmented generation (RAG), system prompting, and guided stable diffusion. However, this is a rising concern about securing the privacy of cross-attention, as the underlying key and value matrices frequently encode sensitive data or private user information. In this work, we introduce a novel data structure designed to enforce differential privacy (DP) for cross-attention mechanisms, accompanied by provable theoretical guarantees. Specifically, letting denote the input sequence length, the feature dimension, the maximum magnitude of query and key matrices, the maximum magnitude of the value matrix, and the parameters for polynomial kernel methods, our proposed structure achieves space and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Linear Warmup With Linear Decay · Multi-Head Attention · Weight Decay · Residual Connection · Dropout · WordPiece · Attention Dropout · Adam
