Correcting FLORES Evaluation Dataset for Four African Languages
Idris Abdulmumin, Sthembiso Mkhwanazi, Mahlatse S. Mbooi, Shamsuddeen, Hassan Muhammad, Ibrahim Said Ahmad, Neo Putini, Miehleketo Mathebula,, Matimba Shingange, Tajuddeen Gwadabe, Vukosi Marivate

TL;DR
This paper corrects and improves the FLORES evaluation dataset for four African languages by addressing inaccuracies through native speaker review, enhancing its reliability for NLP task evaluation.
Contribution
It introduces a meticulous correction process for the FLORES dataset involving native speakers, improving data quality for low-resource language evaluation.
Findings
Enhanced dataset accuracy and reliability
Statistical validation of corrections
Recommendations for future low-resource language data collection
Abstract
This paper describes the corrections made to the FLORES evaluation (dev and devtest) dataset for four African languages, namely Hausa, Northern Sotho (Sepedi), Xitsonga, and isiZulu. The original dataset, though groundbreaking in its coverage of low-resource languages, exhibited various inconsistencies and inaccuracies in the reviewed languages that could potentially hinder the integrity of the evaluation of downstream tasks in natural language processing (NLP), especially machine translation. Through a meticulous review process by native speakers, several corrections were identified and implemented, improving the overall quality and reliability of the dataset. For each language, we provide a concise summary of the errors encountered and corrected and also present some statistical analysis that measures the difference between the existing and corrected datasets. We believe that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
