Improving CNN classifiers by estimating test-time priors

Milan Sulc; Jiri Matas

arXiv:1805.08235·cs.CV·April 10, 2019

Improving CNN classifiers by estimating test-time priors

Milan Sulc, Jiri Matas

PDF

TL;DR

This paper introduces a Maximum a Posteriori approach for estimating test-time class priors in CNN classifiers, significantly improving accuracy in fine-grained and unknown prior scenarios.

Contribution

It proposes a novel MAP estimation method with a Dirichlet hyper-prior, enhancing stability and accuracy over existing MLE methods for test-time prior estimation.

Findings

01

Significant accuracy improvements on fine-grained datasets.

02

State-of-the-art results on PlantCLEF with unknown priors.

03

MAP estimation outperforms MLE in accuracy, especially with unknown priors.

Abstract

The problem of different training and test set class priors is addressed in the context of CNN classifiers. We compare two different approaches to estimating the new priors: an existing Maximum Likelihood Estimation approach (optimized by an EM algorithm or by projected gradient descend) and a proposed Maximum a Posteriori approach, which increases the stability of the estimate by introducing a Dirichlet hyper-prior on the class prior probabilities. Experimental results show a significant improvement on the fine-grained classification tasks using known evaluation-time priors, increasing the top-1 accuracy by 4.0% on the FGVC iNaturalist 2018 validation set and by 3.9% on the FGVCx Fungi 2018 validation set. Estimation of the unknown test set priors noticeably increases the accuracy on the PlantCLEF dataset, allowing a single CNN model to achieve state-of-the-art results and outperform…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.