Darts-Conformer: Towards Efficient Gradient-Based Neural Architecture Search For End-to-End ASR
Xian Shi, Pan Zhou, Wei Chen, Lei Xie

TL;DR
This paper introduces Darts-Conformer, an efficient neural architecture search method for end-to-end ASR that automatically finds high-performing Conformer-based models with significantly reduced computational cost.
Contribution
It applies differentiable architecture search to Conformer models for ASR, achieving automatic architecture optimization with low computational overhead.
Findings
Search process costs only 0.7 GPU days
Achieves 4.7% relative WER reduction on AISHELL-1
Demonstrates transferability to larger datasets
Abstract
Neural architecture search (NAS) has been successfully applied to tasks like image classification and language modeling for finding efficient high-performance network architectures. In ASR field especially end-to-end ASR, the related research is still in its infancy. In this work, we focus on applying NAS on the most popular manually designed model: Conformer, and then propose an efficient ASR model searching method that benefits from the natural advantage of differentiable architecture search (Darts) in reducing computational overheads. We fuse Darts mutator and Conformer blocks to form a complete search space, within which a modified architecture called Darts-Conformer cell is found automatically. The entire searching process on AISHELL-1 dataset costs only 0.7 GPU days. Replacing the Conformer encoder by stacking searched cell, we get an end-to-end ASR model (named as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Topic Modeling · Multimodal Machine Learning Applications
