Bayes Classification using an approximation to the Joint Probability Distribution of the Attributes
Patrick Hosein, Kevin Baboolal

TL;DR
This paper introduces a novel Bayesian classification method that estimates joint attribute distributions using local neighborhood information, effectively handling zero-frequency issues and attribute dependencies, outperforming traditional approaches.
Contribution
It proposes a new Bayesian classifier that considers attribute dependencies and local neighborhood information, improving over Gaussian and Laplace smoothing methods.
Findings
The proposed method outperforms standard classifiers on UCI datasets.
It effectively addresses the zero-frequency problem without assuming attribute independence.
The approach is simple, robust, and competitive with k-NN classifier.
Abstract
The Naive-Bayes classifier is widely used due to its simplicity, speed and accuracy. However this approach fails when, for at least one attribute value in a test sample, there are no corresponding training samples with that attribute value. This is known as the zero frequency problem and is typically addressed using Laplace Smoothing. However, Laplace Smoothing does not take into account the statistical characteristics of the neighbourhood of the attribute values of the test sample. Gaussian Naive Bayes addresses this but the resulting Gaussian model is formed from global information. We instead propose an approach that estimates conditional probabilities using information in the neighbourhood of the test sample. In this case we no longer need to make the assumption of independence of attribute values and hence consider the joint probability distribution conditioned on the given class…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy and Chemometric Analyses · Fault Detection and Control Systems · Advanced Statistical Methods and Models
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
