Certifiable Robustness for Nearest Neighbor Classifiers
Austen Z. Fan, Paraschos Koutris

TL;DR
This paper investigates the computational complexity of certifying the robustness of k-Nearest Neighbors classifiers in the presence of inconsistent datasets with functional dependencies, revealing a dichotomy in problem complexity.
Contribution
It establishes a complexity dichotomy for certifying robustness of k-NN classifiers under data inconsistencies with functional dependencies, including polynomial-time and coNP-hard cases.
Findings
Complexity dichotomy for robustness certification problems.
Polynomial-time algorithms exist for certain FDs.
Certifying robustness is coNP-hard in some cases.
Abstract
ML models are typically trained using large datasets of high quality. However, training datasets often contain inconsistent or incomplete data. To tackle this issue, one solution is to develop algorithms that can check whether a prediction of a model is certifiably robust. Given a learning algorithm that produces a classifier and given an example at test time, a classification outcome is certifiably robust if it is predicted by every model trained across all possible worlds (repairs) of the uncertain (inconsistent) dataset. This notion of robustness falls naturally under the framework of certain answers. In this paper, we study the complexity of certifying robustness for a simple but widely deployed classification algorithm, -Nearest Neighbors (-NN). Our main focus is on inconsistent datasets when the integrity constraints are functional dependencies (FDs). For this setting, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRepair
