Improving classifier decision boundaries using nearest neighbors

Johannes Schneider

arXiv:2310.03927·cs.LG·October 9, 2023

Improving classifier decision boundaries using nearest neighbors

Johannes Schneider

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a simple nearest neighbors-based method to improve neural network decision boundaries, enhancing robustness, accuracy, and interpretability without altering the model architecture or training process.

Contribution

The authors propose a straightforward algorithm that averages predictions with nearest neighbors in latent space, improving multiple neural network performance metrics.

Findings

01

Enhanced resistance to label noise

02

Improved robustness against adversarial attacks

03

Increased classification accuracy

Abstract

Neural networks are not learning optimal decision boundaries. We show that decision boundaries are situated in areas of low training data density. They are impacted by few training samples which can easily lead to overfitting. We provide a simple algorithm performing a weighted average of the prediction of a sample and its nearest neighbors' (computed in latent space) leading to a minor favorable outcomes for a variety of important measures for neural networks. In our evaluation, we employ various self-trained and pre-trained convolutional neural networks to show that our approach improves (i) resistance to label noise, (ii) robustness against adversarial attacks, (iii) classification accuracy, and to some degree even (iv) interpretability. While improvements are not necessarily large in all four areas, our approach is conceptually simple, i.e., improvements come without any…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 1· strong rejectConfidence 4

Strengths

- The proposal of analyzing the latent space through the lens of nearest neighbors makes sense and can provide some insight on model behavior. - Results reported show small, although noticeable performance improvement over the baseline models.

Weaknesses

- The overall writing and presentation is poor. In particular, it is difficult to make sense of many figures (e.g., Figs. 2, 3 and 5). Algorithm 1 is also quite messy, while not carrying much information. - The claims are broad and not well-supported (see Table 1). For instance, the interpretability claim is not accurate nor properly illustrated with experiments. Being able to identify the nearest neighbors and use them to interpret classification of the current sample is a local and quite weak

Reviewer 02Rating 3· reject, not good enoughConfidence 4

Strengths

1. The proposed method is simple and can be incorporated in any layered neural network used for classification. (originality) 2. The idea to consider the k-nearest neighbors to stabilize the prediction of a neural network is nice. (originality) 3. The measuring of the distances in the latent space where samples of one class should already be sufficiently close due to the feature engineering is good. (originality) 4. The evaluation of the method with respect to 4 different criteria is really good

Weaknesses

Please see for further information Questions where I will explain these points in more detail. 1. some claims/statements in the paper are not sufficiently backed up with results or references. (quality) 2. the proposed method should be better discussed with respect to state of the art. (quality + originality) 3. The improvements are minor (as honestly stated by the authors). In general, IMHO, this is not a problem but the authors could have presented the results in a better way. For instance, t

Reviewer 03Rating 3· reject, not good enoughConfidence 4

Strengths

Paper has a broad and practical view. It proposes a relatively inexpensive algorithm with clear goals. Overall, I find the approach interesting, and the method has the potential to become a useful contribution.

Weaknesses

The main weaknesses, in my view, are the weak set of experiments and the weak literature review. There is also some arbitrariness in the method regarding the choice of layer and other parameters in the algorithm. ------- The models used in the experiments have surprisingly weak accuracies. Paper spends considerable space describing the models, the dataset, and the specific details of how they have been trained. However, the reported testing accuracies are quite low. Paper can download standard

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuzzy Logic and Control Systems · Neural Networks and Applications