Privacy-Preserving Mechanisms Enable Cheap Verifiable Inference of LLMs
Arka Pal, Louai Zahran, William Gvozdjak, Akilesh Potti, Micah Goldblum

TL;DR
This paper introduces new, cost-effective privacy-preserving protocols that enable verifiable large language model inference, ensuring trustworthiness without significant computational overhead.
Contribution
The authors develop two novel protocols that leverage privacy-preserving inference to achieve cheap, verifiable LLM inference with minimal additional computation.
Findings
Protocols are computationally inexpensive to implement
Verification runtime is faster than existing cryptographic methods
Guarantees over inference results are effectively provided
Abstract
As large language models (LLMs) continue to grow in size, fewer users are able to host and run models locally. This has led to increased use of third-party hosting services. However, in this setting, there is a lack of guarantees on the computation performed by the inference provider. For example, a dishonest provider may replace an expensive large model with a cheaper-to-run weaker model and return the results from the weaker model to the user. Existing tools to verify inference typically rely on methods from cryptography such as zero-knowledge proofs (ZKPs), but these add significant computational overhead, and remain infeasible for use for large models. In this work, we develop a new insight -- that given a method for performing private LLM inference, one can obtain forms of verified inference at marginal extra cost. Specifically, we propose two new protocols which leverage…
Peer Reviews
Decision·Submitted to ICLR 2026
The submission explores alternative solutions to verifiable LLM inference. Since zero-knowledge proofs for verifiable LLM are prohibitively expensive due to the underlying cryptographic operations, proposing a new paradigm is a promising direction.
Unfortunately, the submission does not adequately justify the security of the proposed methods. - Protocol 3 does not provide a meaningful guarantee. Since the user prompt always follows the same format, in the FHE setting, any malicious inference provider can identify the location of the key in the text. By applying suitable homomorphic operations, an adversary can single out the ciphertext containing the key and append it to an arbitrary output text. Thus, this methodology barely meets the re
1. This work presents a creative and timely exploration of a relatively unstudied relationship between privacy and verifiability in large model inference. 2. The motivation is clear, as the growing reliance on third-party model hosting introduces both privacy and integrity risks. 3. The three proposed protocols provide a spectrum of practical trade-offs between interaction cost, computational efficiency, and verification strength.
1. The paper seems to rely mostly on empirical results rather than formal proofs, and it is not clear how strong the guarantees are compared with existing cryptographic approaches. 2. The evaluation, while comprehensive in experiments, could include more discussion of practical deployment aspects such as latency, communication overhead, and integration into existing systems. It is also not entirely clear how well the proposed methods scale to long or interactive LLM sessions. 3. For readers wi
1. This paper addresses a high-priority problem in LLM deployment with a novel, practical insight: leveraging private inference to enable low-overhead verification. 2. The link between private and verified inference is a key innovation.
1. The presentation of this paper is not good enough, and it is a little hard to understand how it works. Maybe a schematic graph helps 2. What are connections between three protocols? How does this paper relate to existing works?
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Cryptography and Data Security
