# Exploring Confidence Measures for Word Spotting in Heterogeneous   Datasets

**Authors:** Fabian Wolf, Philipp Oberdiek, Gernot A. Fink

arXiv: 1903.10930 · 2019-03-27

## TL;DR

This paper investigates various confidence measures for CNN-based word spotting in diverse datasets, aiming to improve retrieval reliability and understand the relation between confidence and attribute estimation quality.

## Contribution

It introduces and compares four confidence metrics for CNN word spotting, addressing overconfidence issues and dataset reliability assessment.

## Key findings

- Certain confidence measures correlate with attribute estimation quality.
- Confidence metrics can filter unreliable retrieval candidates.
- The approach enhances understanding of CNN reliability in document analysis.

## Abstract

In recent years, convolutional neural networks (CNNs) took over the field of document analysis and they became the predominant model for word spotting. Especially attribute CNNs, which learn the mapping between a word image and an attribute representation, showed exceptional performances. The drawback of this approach is the overconfidence of neural networks when used out of their training distribution. In this paper, we explore different metrics for quantifying the confidence of a CNN in its predictions, specifically on the retrieval problem of word spotting. With these confidence measures, we limit the inability of a retrieval list to reject certain candidates. We investigate four different approaches that are either based on the network's attribute estimations or make use of a surrogate model. Our approach also aims at answering the question for which part of a dataset the retrieval system gives reliable results. We further show that there exists a direct relation between the proposed confidence measures and the quality of an estimated attribute representation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.10930/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1903.10930/full.md

## References

14 references — full list in the complete paper: https://tomesphere.com/paper/1903.10930/full.md

---
Source: https://tomesphere.com/paper/1903.10930