k-Nearest Neighbors by Means of Sequence to Sequence Deep Neural Networks and Memory Networks
Yiming Xu, Diego Klabjan

TL;DR
This paper introduces sequence-to-sequence and memory network models that mimic k-Nearest Neighbors, outperform traditional models on structured data, and serve as effective oversamplers for imbalanced datasets.
Contribution
It presents novel neural network architectures that replicate kNN behavior and enhance oversampling techniques for imbalanced data.
Findings
Models outperform kNN, neural networks, XGBoost, and random forests on structured datasets.
Performance on image and text data is comparable to state-of-the-art deep models.
Sequence-to-sequence kNN often surpasses SMOTE and ADASYN as an oversampler.
Abstract
k-Nearest Neighbors is one of the most fundamental but effective classification models. In this paper, we propose two families of models built on a sequence to sequence model and a memory network model to mimic the k-Nearest Neighbors model, which generate a sequence of labels, a sequence of out-of-sample feature vectors and a final label for classification, and thus they could also function as oversamplers. We also propose 'out-of-core' versions of our models which assume that only a small portion of data can be loaded into memory. Computational experiments show that our models on structured datasets outperform k-Nearest Neighbors, a feed-forward neural network, XGBoost, lightGBM, random forest and a memory network, due to the fact that our models must produce additional output and not just the label. On image and text datasets, the performance of our model is close to many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Imbalanced Data Classification Techniques · Domain Adaptation and Few-Shot Learning
MethodsMemory Network
