Enabling Interactive Transcription in an Indigenous Community
\'Eric Le Ferrand, Steven Bird, Laurent Besacier

TL;DR
This paper introduces a new transcription workflow that combines spoken term detection and human-in-the-loop methods to bootstrap transcription of endangered languages with minimal initial data.
Contribution
It presents a novel approach for early-stage transcription of endangered languages using minimal resources and a pilot experiment demonstrating its effectiveness.
Findings
Bootstrapping transcription from few isolated words is feasible.
The workflow improves transcription accuracy in low-resource scenarios.
Early-stage transcription can be initiated with minimal initial data.
Abstract
We propose a novel transcription workflow which combines spoken term detection and human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero-resource scenario where only a few terms have so far been identified, involving two endangered languages. We show that in the early stages of transcription, when the available data is insufficient to train a robust ASR system, it is possible to take advantage of the transcription of a small number of isolated words in order to bootstrap the transcription of a speech collection.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Natural Language Processing Techniques · Speech Recognition and Synthesis
