Efficient Estimation of the number of neighbours in Probabilistic K Nearest Neighbour Classification
Ji Won Yoon, Nial Friel

TL;DR
This paper introduces a Bayesian approach with a new approximation algorithm to estimate the optimal number of neighbors in Probabilistic KNN, improving classification accuracy without extensive computation.
Contribution
It presents a novel functional approximation algorithm for Bayesian model averaging of the neighbor count, avoiding Monte Carlo simulations and cross-validation.
Findings
Good performance on real datasets
Effective uncertainty modeling in neighbor selection
Avoids computationally intensive methods
Abstract
Probabilistic k-nearest neighbour (PKNN) classification has been introduced to improve the performance of original k-nearest neighbour (KNN) classification algorithm by explicitly modelling uncertainty in the classification of each feature vector. However, an issue common to both KNN and PKNN is to select the optimal number of neighbours, . The contribution of this paper is to incorporate the uncertainty in into the decision making, and in so doing use Bayesian model averaging to provide improved classification. Indeed the problem of assessing the uncertainty in can be viewed as one of statistical model selection which is one of the most important technical issues in the statistics and machine learning domain. In this paper, a new functional approximation algorithm is proposed to reconstruct the density of the model (order) without relying on time consuming Monte Carlo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Bayesian Methods and Mixture Models
