Verifiable evaluations of machine learning models using zkSNARKs
Tobin South, Alexander Camuto, Shrey Jain, Shayla Nguyen, Robert, Mahari, Christian Paquin, Jason Morton, Alex 'Sandy' Pentland

TL;DR
This paper introduces a method using zkSNARKs to create verifiable proofs of machine learning model evaluations, enabling users to confirm model performance and fairness without re-running benchmarks.
Contribution
It presents a novel approach for verifiable model evaluation using zero-knowledge proofs, applicable to standard neural networks with varying computational costs.
Findings
Successfully demonstrated verifiable evaluations on real-world models
Identified key challenges and solutions in implementing zkSNARKs for ML
Provides a new transparency paradigm for private model assessments
Abstract
In a world of increasing closed-source commercial machine learning models, model evaluations from developers must be taken at face value. These benchmark results-whether over task accuracy, bias evaluations, or safety checks-are traditionally impossible to verify by a model end-user without the costly or impossible process of re-performing the benchmark on black-box model outputs. This work presents a method of verifiable model evaluation using model inference through zkSNARKs. The resulting zero-knowledge computational proofs of model outputs over datasets can be packaged into verifiable evaluation attestations showing that models with fixed private weights achieve stated performance or fairness metrics over public inputs. We present a flexible proving system that enables verifiable attestations to be performed on any standard neural network model with varying compute requirements. For…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
