Federated Calibration and Evaluation of Binary Classifiers
Graham Cormode, Igor Markov

TL;DR
This paper develops methods for calibrating and evaluating binary classifiers in federated settings, ensuring privacy while maintaining accuracy across different privacy models.
Contribution
It introduces novel techniques for federated calibration and evaluation of classifiers under various privacy constraints, addressing key practical challenges.
Findings
Calibration and evaluation are feasible under secure aggregation, distributed DP, and local DP.
Tradeoffs between privacy, accuracy, and data efficiency are characterized.
Guidelines for data sufficiency in federated calibration are provided.
Abstract
We address two major obstacles to practical use of supervised classifiers on distributed private data. Whether a classifier was trained by a federation of cooperating clients or trained centrally out of distribution, (1) the output scores must be calibrated, and (2) performance metrics must be evaluated -- all without assembling labels in one place. In particular, we show how to perform calibration and compute precision, recall, accuracy and ROC-AUC in the federated setting under three privacy models (i) secure aggregation, (ii) distributed differential privacy, (iii) local differential privacy. Our theorems and experiments clarify tradeoffs between privacy, accuracy, and data efficiency. They also help decide whether a given application has sufficient data to support federated calibration and evaluation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Blockchain Technology Applications and Security
