DW-KNN: A Transparent Local Classifier Integrating Distance Consistency and Neighbor Reliability
Kumarjit Pathak, Karthik K, Sachin Madan, and Jitin Kapila

TL;DR
DW-KNN is a transparent, robust local classifier that improves reliability and interpretability by integrating distance and neighbor validity, outperforming standard KNN variants in accuracy and stability.
Contribution
The paper introduces DW-KNN, a novel variant of KNN that combines exponential distance and neighbor validity for enhanced interpretability and robustness in heterogeneous feature spaces.
Findings
Achieves 0.8988 accuracy on average across 9 datasets.
Ranks 2nd among six methods, close to the best ensemble KNN.
Exhibits the lowest cross-validation variance, indicating stable predictions.
Abstract
K-Nearest Neighbors (KNN) is one of the most used ML classifiers. However, if we observe closely, standard distance-weighted KNN and relative variants assume all 'k' neighbors are equally reliable. In heterogeneous feature space, this becomes a limitation that hinders reliability in predicting true levels of the observation. We propose DW-KNN (Double Weighted KNN), a transparent and robust variant that integrates exponential distance with neighbor validity. This enables instance-level interpretability, suppresses noisy or mislabeled samples, and reduces hyperparameter sensitivity. Comprehensive evaluation on 9 data-sets helps to demonstrate that DW-KNN achieves 0.8988 accuracy on average. It ranks 2nd among six methods and within 0.2% of the best-performing Ensemble KNN. It also exhibits the lowest cross-validation variance (0.0156), indicating reliable prediction stability.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Imbalanced Data Classification Techniques
