CoMFLP: Correlation Measure based Fast Search on ASR Layer Pruning
Wei Liu, Zhiyuan Peng, Tan Lee

TL;DR
This paper introduces CoMFLP, a fast layer pruning method for speech recognition models that uses correlation measures to efficiently identify redundant layers, significantly reducing search time while maintaining or improving performance.
Contribution
The paper proposes a novel correlation-based fast search algorithm for layer pruning in ASR models, reducing complexity from time-consuming evaluations to constant time.
Findings
Outperforms existing layer pruning methods in accuracy.
Requires only constant time complexity for pruning.
Effective on speech recognition tasks.
Abstract
Transformer-based speech recognition (ASR) model with deep layers exhibited significant performance improvement. However, the model is inefficient for deployment on resource-constrained devices. Layer pruning (LP) is a commonly used compression method to remove redundant layers. Previous studies on LP usually identify the redundant layers according to a task-specific evaluation metric. They are time-consuming for models with a large number of layers, even in a greedy search manner. To address this problem, we propose CoMFLP, a fast search LP algorithm based on correlation measure. The correlation between layers is computed to generate a correlation matrix, which identifies the redundancy among layers. The search process is carried out in two steps: (1) coarse search: to determine top candidates by pruning the most redundant layers based on the correlation matrix; (2) fine search: to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
MethodsPruning
