Effects of Layer Freezing on Transferring a Speech Recognition System to   Under-resourced Languages

Onno Eberhard; Torsten Zesch

arXiv:2102.04097·cs.CL·October 6, 2022

Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced Languages

Onno Eberhard, Torsten Zesch

PDF

Open Access 1 Repo

TL;DR

This study explores how freezing layers in a speech recognition model affects transfer learning performance for under-resourced languages, demonstrating that even minimal freezing can significantly improve results.

Contribution

It systematically evaluates layer freezing schemes in transfer learning for speech recognition, highlighting the benefits of partial freezing in low-resource scenarios.

Findings

01

Freezing one layer significantly improves transfer performance.

02

Layer freezing schemes outperform training from scratch.

03

Partial freezing is effective for under-resourced languages.

Abstract

In this paper, we investigate the effect of layer freezing on the effectiveness of model transfer in the area of automatic speech recognition. We experiment with Mozilla's DeepSpeech architecture on German and Swiss German speech datasets and compare the results of either training from scratch vs. transferring a pre-trained model. We compare different layer freezing schemes and find that even freezing only one layer already significantly improves results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

onnoeberhard/deepspeech
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling