Incorporating Crowdsourced Annotator Distributions into Ensemble   Modeling to Improve Classification Trustworthiness for Ancient Greek Papyri

Graham West; Matthew I. Swindall; Ben Keener; Timothy Player; Alex C.; Williams; James H. Brusuelas; John F. Wallin

arXiv:2210.16380·cs.CV·February 14, 2024

Incorporating Crowdsourced Annotator Distributions into Ensemble Modeling to Improve Classification Trustworthiness for Ancient Greek Papyri

Graham West, Matthew I. Swindall, Ben Keener, Timothy Player, Alex C., Williams, James H. Brusuelas, John F. Wallin

PDF

Open Access

TL;DR

This paper enhances classification trustworthiness for ancient Greek papyri by incorporating crowdsourced annotator distributions into ensemble models, improving accuracy and uncertainty estimation.

Contribution

It introduces a novel ensemble approach that integrates crowdsourced annotation distributions, boosting accuracy and enabling better uncertainty quantification in noisy datasets.

Findings

01

Ensemble model achieves over 95% accuracy, surpassing individual ResNets.

02

Entropy analysis effectively predicts model misclassifications.

03

Crowdsourced annotation distributions improve trustworthiness in noisy data.

Abstract

Performing classification on noisy, crowdsourced image datasets can prove challenging even for the best neural networks. Two issues which complicate the problem on such datasets are class imbalance and ground-truth uncertainty in labeling. The AL-ALL and AL-PUB datasets - consisting of tightly cropped, individual characters from images of ancient Greek papyri - are strongly affected by both issues. The application of ensemble modeling to such datasets can help identify images where the ground-truth is questionable and quantify the trustworthiness of those samples. As such, we apply stacked generalization consisting of nearly identical ResNets with different loss functions: one utilizing sparse cross-entropy (CXE) and the other Kullback-Liebler Divergence (KLD). Both networks use labels drawn from a crowd-sourced consensus. This consensus is derived from a Normalized Distribution of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Data Stream Mining Techniques

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · 1x1 Convolution · Batch Normalization · Global Average Pooling · Kaiming Initialization · Max Pooling · Residual Connection · Bottleneck Residual Block · Residual Block