Certifiable Robustness for Nearest Neighbor Classifiers

Austen Z. Fan; Paraschos Koutris

arXiv:2201.04770·cs.LG·January 19, 2022

Certifiable Robustness for Nearest Neighbor Classifiers

Austen Z. Fan, Paraschos Koutris

PDF

TL;DR

This paper investigates the computational complexity of certifying the robustness of k-Nearest Neighbors classifiers in the presence of inconsistent datasets with functional dependencies, revealing a dichotomy in problem complexity.

Contribution

It establishes a complexity dichotomy for certifying robustness of k-NN classifiers under data inconsistencies with functional dependencies, including polynomial-time and coNP-hard cases.

Findings

01

Complexity dichotomy for robustness certification problems.

02

Polynomial-time algorithms exist for certain FDs.

03

Certifying robustness is coNP-hard in some cases.

Abstract

ML models are typically trained using large datasets of high quality. However, training datasets often contain inconsistent or incomplete data. To tackle this issue, one solution is to develop algorithms that can check whether a prediction of a model is certifiably robust. Given a learning algorithm that produces a classifier and given an example at test time, a classification outcome is certifiably robust if it is predicted by every model trained across all possible worlds (repairs) of the uncertain (inconsistent) dataset. This notion of robustness falls naturally under the framework of certain answers. In this paper, we study the complexity of certifying robustness for a simple but widely deployed classification algorithm, $k$ -Nearest Neighbors ( $k$ -NN). Our main focus is on inconsistent datasets when the integrity constraints are functional dependencies (FDs). For this setting, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRepair