Evaluation of Error Probability of Classification Based on the Analysis   of the Bayes Code: Extension and Example

Shota Saito; Toshiyasu Matsushima

arXiv:1910.03257·cs.IT·May 4, 2021·1 cites

Evaluation of Error Probability of Classification Based on the Analysis of the Bayes Code: Extension and Example

Shota Saito, Toshiyasu Matsushima

PDF

Open Access

TL;DR

This paper extends the analysis of the error probability in classification using Bayes code by removing previous restrictions, generalizing the results, and providing numerical calculations for specific models.

Contribution

It generalizes previous bounds on classification error by removing assumptions on priors and includes finite blocklength numerical analysis.

Findings

01

More general error bounds for classification with Bayes code

02

Numerical results for specific models at finite blocklength

03

Enhanced understanding of error probabilities in hypothesis testing

Abstract

Suppose that we have two training sequences generated by parametrized distributions $P_{θ^{*}}$ and $P_{ξ^{*}}$ , where $θ^{*}$ and $ξ^{*}$ are unknown true parameters. Given training sequences, we study the problem of classifying whether a test sequence was generated according to $P_{θ^{*}}$ or $P_{ξ^{*}}$ . This problem can be thought of as a hypothesis testing problem and our aim is to analyze the weighted sum of type-I and type-II error probabilities. Utilizing the analysis of the codeword lengths of the Bayes code, our previous study derived more refined bounds on the error probability than known previously. However, our previous study had the following deficiencies: i) the prior distributions of $θ$ and $ξ$ are the same; ii) the prior distributions of two hypotheses are uniform; iii) no numerical calculation at finite blocklength. This study solves these problems. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Imbalanced Data Classification Techniques