Improving classifier decision boundaries using nearest neighbors
Johannes Schneider

TL;DR
This paper introduces a simple nearest neighbors-based method to improve neural network decision boundaries, enhancing robustness, accuracy, and interpretability without altering the model architecture or training process.
Contribution
The authors propose a straightforward algorithm that averages predictions with nearest neighbors in latent space, improving multiple neural network performance metrics.
Findings
Enhanced resistance to label noise
Improved robustness against adversarial attacks
Increased classification accuracy
Abstract
Neural networks are not learning optimal decision boundaries. We show that decision boundaries are situated in areas of low training data density. They are impacted by few training samples which can easily lead to overfitting. We provide a simple algorithm performing a weighted average of the prediction of a sample and its nearest neighbors' (computed in latent space) leading to a minor favorable outcomes for a variety of important measures for neural networks. In our evaluation, we employ various self-trained and pre-trained convolutional neural networks to show that our approach improves (i) resistance to label noise, (ii) robustness against adversarial attacks, (iii) classification accuracy, and to some degree even (iv) interpretability. While improvements are not necessarily large in all four areas, our approach is conceptually simple, i.e., improvements come without any…
Peer Reviews
Decision·Submitted to ICLR 2024
- The proposal of analyzing the latent space through the lens of nearest neighbors makes sense and can provide some insight on model behavior. - Results reported show small, although noticeable performance improvement over the baseline models.
- The overall writing and presentation is poor. In particular, it is difficult to make sense of many figures (e.g., Figs. 2, 3 and 5). Algorithm 1 is also quite messy, while not carrying much information. - The claims are broad and not well-supported (see Table 1). For instance, the interpretability claim is not accurate nor properly illustrated with experiments. Being able to identify the nearest neighbors and use them to interpret classification of the current sample is a local and quite weak
1. The proposed method is simple and can be incorporated in any layered neural network used for classification. (originality) 2. The idea to consider the k-nearest neighbors to stabilize the prediction of a neural network is nice. (originality) 3. The measuring of the distances in the latent space where samples of one class should already be sufficiently close due to the feature engineering is good. (originality) 4. The evaluation of the method with respect to 4 different criteria is really good
Please see for further information Questions where I will explain these points in more detail. 1. some claims/statements in the paper are not sufficiently backed up with results or references. (quality) 2. the proposed method should be better discussed with respect to state of the art. (quality + originality) 3. The improvements are minor (as honestly stated by the authors). In general, IMHO, this is not a problem but the authors could have presented the results in a better way. For instance, t
Paper has a broad and practical view. It proposes a relatively inexpensive algorithm with clear goals. Overall, I find the approach interesting, and the method has the potential to become a useful contribution.
The main weaknesses, in my view, are the weak set of experiments and the weak literature review. There is also some arbitrariness in the method regarding the choice of layer and other parameters in the algorithm. ------- The models used in the experiments have surprisingly weak accuracies. Paper spends considerable space describing the models, the dataset, and the specific details of how they have been trained. However, the reported testing accuracies are quite low. Paper can download standard
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFuzzy Logic and Control Systems · Neural Networks and Applications
