TL;DR
Dolphin-CN-Dialect is a streaming-capable ASR model optimized for Chinese dialects, featuring improved data handling, tokenization, and sampling strategies, achieving high accuracy and efficiency in dialect recognition.
Contribution
The paper introduces a novel temperature-based sampling strategy and a redesigned tokenizer tailored for Chinese dialects, enhancing dialect recognition and model performance.
Findings
Significant gains in dialect recognition accuracy.
Reduced CER compared to previous models.
Achieves competitive performance with state-of-the-art ASR models.
Abstract
We present Dolphin-CN-Dialect, a streaming-capable ASR model with a focus on Chinese and dialect-rich scenarios. Compared to the previous version, Dolphin-CN-Dialect introduces substantial improvements in data processing, tokenization, training stability, and data sampling strategies. To address the challenges of highly imbalanced dialect data, we propose a temperature-based sampling strategy that effectively balances standard Mandarin and low-resource dialects, leading to significant gains in dialect recognition performance. In addition, we redesign the tokenizer to better align with linguistic characteristics, adopting character-level modeling for Chinese and subword modeling for English, while introducing extensible dialect tokens. Experimental results show that Dolphin-CN-Dialect achieves improvement in dialect recognition accuracy and CER reduction compared to Dolphin. Furthermore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗DataoceanAI1/dolphi-cn-dialect-smallmodel· ♡ 3♡ 3
- 🤗DataoceanAI1/dolphin-cn-dialect-small-streamingmodel· ♡ 2♡ 2
- 🤗DataoceanAI1/dolphin-cn-dialect-small-promptmodel· ♡ 1♡ 1
- 🤗DataoceanAI1/dolphin-cn-dialect-basemodel· ♡ 1♡ 1
- 🤗DataoceanAI1/dolphin-cn-dialect-base-streamingmodel· ♡ 1♡ 1
- 🤗DataoceanAI/dolphi-cn-dialect-smallmodel
- 🤗DataoceanAI/dolphin-cn-dialect-base-streamingmodel
- 🤗DataoceanAI/dolphin-cn-dialect-small-promptmodel
- 🤗DataoceanAI/dolphin-cn-dialect-small-streamingmodel
- 🤗DataoceanAI/dolphin-cn-dialect-basemodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
