The Capacity of Private Information Retrieval from Coded Databases
Karim Banawan, Sennur Ulukus

TL;DR
This paper derives the maximum efficiency of private information retrieval from coded distributed databases, revealing a fundamental tradeoff between retrieval cost and storage cost, generalizing previous results for replicated databases.
Contribution
It establishes the information-theoretic capacity of PIR from coded databases, linking capacity to code rate and number of messages, extending classical PIR results.
Findings
Capacity formula: C=(1+R_c+R_c^2+...+R_c^{M-1})^{-1}
Capacity depends only on code rate and number of messages
Generalizes PIR capacity results from replication to coding
Abstract
We consider the problem of private information retrieval (PIR) over a distributed storage system. The storage system consists of non-colluding databases, each storing a coded version of messages. In the PIR problem, the user wishes to retrieve one of the available messages without revealing the message identity to any individual database. We derive the information-theoretic capacity of this problem, which is defined as the maximum number of bits of the desired message that can be privately retrieved per one bit of downloaded information. We show that the PIR capacity in this case is , where is the rate of the code used. The capacity is a function of the code rate and the number of messages only regardless of the explicit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
