Optimal PAC-Bayesian Posteriors for Stochastic Classifiers and their use for Choice of SVM Regularization Parameter
Puja Sahu, Nandyala Hemachandra

TL;DR
This paper develops methods to find optimal PAC-Bayesian posteriors for stochastic classifiers, especially SVMs, providing tight risk bounds and improved test errors across multiple datasets, with a focus on computational efficiency.
Contribution
It introduces closed-form solutions and fixed point equations for optimal posteriors using various distance functions, enhancing PAC-Bayesian analysis for classifier regularization.
Findings
Optimal posteriors achieve lowest test errors in most datasets.
Linear distance yields convex optimization with closed-form solutions.
KL-divergence bound is tightest but computationally expensive.
Abstract
PAC-Bayesian set up involves a stochastic classifier characterized by a posterior distribution on a classifier set, offers a high probability bound on its averaged true risk and is robust to the training sample used. For a given posterior, this bound captures the trade off between averaged empirical risk and KL-divergence based model complexity term. Our goal is to identify an optimal posterior with the least PAC-Bayesian bound. We consider a finite classifier set and 5 distance functions: KL-divergence, its Pinsker's and a sixth degree polynomial approximations; linear and squared distances. Linear distance based model results in a convex optimization problem. We obtain closed form expression for its optimal posterior. For uniform prior, this posterior has full support with weights negative-exponentially proportional to number of misclassifications. Squared distance and Pinsker's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Markov Chains and Monte Carlo Methods · Distributed Sensor Networks and Detection Algorithms
MethodsTest · Support Vector Machine
