Enhancing Speech Quality through the Integration of BGRU and Transformer Architectures
Souliman Alghnam, Mohammad Alhussien, Khaled Shaheen

TL;DR
This paper presents a hybrid BGRU-Transformer model that significantly improves speech quality in noisy environments by capturing temporal dependencies and complex signal patterns more effectively than traditional or standalone models.
Contribution
It introduces a novel integrated BGRU-Transformer architecture for speech enhancement, demonstrating superior performance over existing methods.
Findings
Outperforms traditional speech enhancement models
Achieves significant noise reduction and speech quality improvement
Enhances robustness and potential for real-world applications
Abstract
Speech enhancement plays an essential role in improving the quality of speech signals in noisy environments. This paper investigates the efficacy of integrating Bidirectional Gated Recurrent Units (BGRU) and Transformer models for speech enhancement tasks. Through a comprehensive experimental evaluation, our study demonstrates the superiority of this hybrid architecture over traditional methods and standalone models. The combined BGRU-Transformer framework excels in capturing temporal dependencies and learning complex signal patterns, leading to enhanced noise reduction and improved speech quality. Results show significant performance gains compared to existing approaches, highlighting the potential of this integrated model in real-world applications. The seamless integration of BGRU and Transformer architectures not only enhances system robustness but also opens the road for advanced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques
MethodsAttention Is All You Need · Absolute Position Encodings · Dense Connections · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer
