The ITU Faroese Pairs Dataset
Leon Derczynski, Annika Solveig Hedegaard Isfeldt, Signhild Djurhuus

TL;DR
This paper introduces the ITU Faroese Pairs Dataset, a bilingual sentence pair dataset for Faroese and Danish, aimed at improving machine translation systems for these languages.
Contribution
It provides a new, publicly available dataset of Faroese-Danish sentence pairs for training and evaluating machine translation models.
Findings
Dataset includes sentence pairs in both translation directions.
Facilitates development of Faroese-Danish machine translation.
Supports research in low-resource language translation.
Abstract
This article documents a dataset of sentence pairs between Faroese and Danish, produced at ITU Copenhagen. The data covers tranlsation from both source languages, and is intended for use as training data for machine translation systems in this language pair.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
