Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge Distillation
Kevin Krahn, Derrick Tate, Andrew C. Lamicela

TL;DR
This paper presents a multilingual knowledge distillation method to develop high-quality sentence embedding models for Ancient Greek, leveraging small parallel datasets and aligning with English translations to enable semantic tasks.
Contribution
It introduces a novel distillation approach and dataset creation for Ancient Greek sentence embeddings, overcoming data scarcity challenges.
Findings
Models perform well on translation search tasks
Effective in semantic similarity and retrieval tasks
Dataset and models are publicly available
Abstract
Contextual language models have been trained on Classical languages, including Ancient Greek and Latin, for tasks such as lemmatization, morphological tagging, part of speech tagging, authorship attribution, and detection of scribal errors. However, high-quality sentence embedding models for these historical languages are significantly more difficult to achieve due to the lack of training data. In this work, we use a multilingual knowledge distillation approach to train BERT models to produce sentence embeddings for Ancient Greek text. The state-of-the-art sentence embedding approaches for high-resource languages use massive datasets, but our distillation approach allows our Ancient Greek models to inherit the properties of these models while using a relatively small amount of translated sentence data. We build a parallel sentence dataset using a sentence-embedding alignment method to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsMulti-Head Attention · Attention Is All You Need · Residual Connection · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Attention Dropout · WordPiece · Softmax · Dense Connections
