Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system
Keisuke Kinoshita, Marc Delcroix, Shoko Araki, Tomohiro Nakatani

TL;DR
This paper demonstrates that an all-neural system for source separation, counting, and diarization can effectively handle real noisy, reverberant meetings, outperforming existing state-of-the-art methods in challenging scenarios.
Contribution
The study extends the all-neural approach to real-world meeting data, improving robustness and demonstrating superior performance over traditional systems.
Findings
Effective speech enhancement in real meeting conditions
Outperforms state-of-the-art systems in noisy, reverberant environments
Robustness improvements enable practical deployment
Abstract
Automatic meeting analysis is an essential fundamental technology required to let, e.g. smart devices follow and respond to our conversations. To achieve an optimal automatic meeting analysis, we previously proposed an all-neural approach that jointly solves source separation, speaker diarization and source counting problems in an optimal way (in a sense that all the 3 tasks can be jointly optimized through error back-propagation). It was shown that the method could well handle simulated clean (noiseless and anechoic) dialog-like data, and achieved very good performance in comparison with several conventional methods. However, it was not clear whether such all-neural approach would be successfully generalized to more complicated real meeting data containing more spontaneously-speaking speakers, severe noise and reverberation, and how it performs in comparison with the state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Indoor and Outdoor Localization Technologies
