Contrastive Learning for Character Detection in Ancient Greek Papyri
Vedasri Nakka, Andreas Fischer, Rolf Ingold, Lars Vogtlin

TL;DR
This study evaluates contrastive learning using SimCLR for ancient Greek letter recognition, finding traditional supervised models outperform SimCLR, likely due to semantic shifts caused by cropping strategies.
Contribution
It provides a comprehensive comparison of contrastive learning versus traditional methods in ancient Greek character detection, highlighting limitations of SimCLR in this context.
Findings
SimCLR does not outperform baseline models in letter recognition.
Traditional supervised models with cross-entropy loss perform better.
Cropping strategies in SimCLR may cause semantic shifts reducing effectiveness.
Abstract
This thesis investigates the effectiveness of SimCLR, a contrastive learning technique, in Greek letter recognition, focusing on the impact of various augmentation techniques. We pretrain the SimCLR backbone using the Alpub dataset (pretraining dataset) and fine-tune it on a smaller ICDAR dataset (finetuning dataset) to compare SimCLR's performance against traditional baseline models, which use cross-entropy and triplet loss functions. Additionally, we explore the role of different data augmentation strategies, essential for the SimCLR training process. Methodologically, we examine three primary approaches: (1) a baseline model using cross-entropy loss, (2) a triplet embedding model with a classification layer, and (3) a SimCLR pretrained model with a classification layer. Initially, we train the baseline, triplet, and SimCLR models using 93 augmentations on ResNet-18 and ResNet-50…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHistorical and Linguistic Studies · Families in Therapy and Culture
MethodsBitcoin Customer Service Number +1-833-534-1729 · Kaiming Initialization · Max Pooling · Convolution · Average Pooling · Global Average Pooling · Dense Connections · Normalized Temperature-scaled Cross Entropy Loss · Feedforward Network · Color Jitter
