Supporting SENCOTEN Language Documentation Efforts with Automatic Speech Recognition

Mengzhe Geng; Patrick Littell; Aidan Pine; PEN\'A\'C; Marc Tessier; Roland Kuhn

arXiv:2507.10827·cs.SD·July 22, 2025

Supporting SENCOTEN Language Documentation Efforts with Automatic Speech Recognition

Mengzhe Geng, Patrick Littell, Aidan Pine, PEN\'A\'C, Marc Tessier, Roland Kuhn

PDF

Open Access

TL;DR

This paper presents an ASR-based pipeline utilizing augmented data and transfer learning to support SENCOTEN language documentation, achieving promising accuracy despite limited data and complex linguistic features.

Contribution

It introduces a novel ASR-driven documentation pipeline combining augmented speech data, cross-lingual transfer, and language modeling for low-resource polysynthetic languages.

Findings

01

Achieved 19.34% WER and 5.09% CER on SENCOTEN test data.

02

Improved WER to 14.32% after filtering minor errors.

03

Demonstrated potential for supporting language revitalization efforts.

Abstract

The SENCOTEN language, spoken on the Saanich peninsula of southern Vancouver Island, is in the midst of vigorous language revitalization efforts to turn the tide of language loss as a result of colonial language policies. To support these on-the-ground efforts, the community is turning to digital technology. Automatic Speech Recognition (ASR) technology holds great promise for accelerating language documentation and the creation of educational resources. However, developing ASR systems for SENCOTEN is challenging due to limited data and significant vocabulary variation from its polysynthetic structure and stress-driven metathesis. To address these challenges, we propose an ASR-driven documentation pipeline that leverages augmented speech data from a text-to-speech (TTS) system and cross-lingual transfer learning with Speech Foundation Models (SFMs). An n-gram language model is also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Phonetics and Phonology Research