MyVoice: Arabic Speech Resource Collaboration Platform
Yousseif Elshahawy, Yassine El Kheir, Shammur Absar Chowdhury, and, Ahmed Ali

TL;DR
MyVoice is a collaborative crowdsourcing platform for collecting and validating large-scale Arabic speech datasets, focusing on dialectal diversity and quality assurance to support speech technology development.
Contribution
It introduces a flexible, role-switching platform with integrated quality control for creating and sharing Arabic speech resources, enhancing dialectal speech technology research.
Findings
Successful collection of diverse Arabic speech data
Effective quality assurance filtering system
Facilitated community collaboration in speech data gathering
Abstract
We introduce MyVoice, a crowdsourcing platform designed to collect Arabic speech to enhance dialectal speech technologies. This platform offers an opportunity to design large dialectal speech datasets; and makes them publicly available. MyVoice allows contributors to select city/country-level fine-grained dialect and record the displayed utterances. Users can switch roles between contributors and annotators. The platform incorporates a quality assurance system that filters out low-quality and spurious recordings before sending them for validation. During the validation phase, contributors can assess the quality of recordings, annotate them, and provide feedback which is then reviewed by administrators. Furthermore, the platform offers flexibility to admin roles to add new data or tasks beyond dialectal speech and word collection, which are displayed to contributors. Thus, enabling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Speech Recognition and Synthesis · Natural Language Processing Techniques
